Datasets

PyRapidML.datasets.extract_data(dataset='index', save_copy=False, profile=False, verbose=True)

This function loads sample datasets from git repository. List of available datasets can be checked using get_data('index').

Example

>>> from PyRapidML.datasets import get_data
>>> all_datasets = extract_data('index')
>>> juice = extract_data('juice')
dataset: str, default = ‘index’

Index value of dataset.

save_copy: bool, default = False

When set to true, it saves a copy in current working directory.

profile: bool, default = False

When set to true, an interactive EDA report is displayed.

verbose: bool, default = True

When set to False, head of data is not displayed.

Returns

pandas.DataFrame

Warning

  • Use of extract_data requires internet connection.