Datasets¶
- PyRapidML.datasets.extract_data(dataset='index', save_copy=False, profile=False, verbose=True)¶
This function loads sample datasets from git repository. List of available datasets can be checked using
get_data('index').Example
>>> from PyRapidML.datasets import get_data >>> all_datasets = extract_data('index') >>> juice = extract_data('juice')
- dataset: str, default = ‘index’
Index value of dataset.
- save_copy: bool, default = False
When set to true, it saves a copy in current working directory.
- profile: bool, default = False
When set to true, an interactive EDA report is displayed.
- verbose: bool, default = True
When set to False, head of data is not displayed.
- Returns
pandas.DataFrame
Warning
Use of
extract_datarequires internet connection.