parsnip.load_dataset
- parsnip.load_dataset(path, kind=None, in_memory=True, reject_invalid=True, require_redshift=True, label_map=None, valid_classes=None, verbose=True)
Load a dataset using the lcdata package.
This can be any lcdata HDF5 dataset. We use
parse_datasetto clean things up for ParSNIP by rejecting irrelevant light curves (e.g. galactic ones) and updating class labels.We try to guess the dataset type from the filename. If this doesn’t work, specify the filename explicitly instead.
- Parameters:
path (str) – Path to the dataset on disk
kind (str, optional) – Kind of dataset, by default we will attempt to determine it from the filename
in_memory (bool, optional) – If False, don’t load the light curves into memory, and only load the metadata. See
lcdata.Datasetfor details.reject_invalid (bool, optional) – Whether to reject invalid light curves, by default True
label_map (dict, optional) – Overwriting the default classification label mapping with a custom dict
verbose (bool, optional) – If True, print parsing information, by default True
- Returns:
Loaded dataset
- Return type:
Dataset