explabox.explore

Functions/classes for exploring your data (dataset descriptives).

class explabox.explore.Explorer(data=None, ingestibles=None, **kwargs)

Bases: Readable, IngestiblesMixin

The Explorer explores your data by providing descriptive statistics.

The Explorer requires ‘data’ defined. It is included in the Explabox under the .explore property.

Examples

Get dataset descriptives:

>>> from explabox.explore import Explorer
>>> explorer = Explorer(data=data)
>>> explorer()

Show the first 10 instances of the test split

>>> from explabox.explore import Explorer
>>> explorer = Explorer(data=data)
>>> explorer.instances(split="test")[:10]
Parameters:
  • data (Optional[Environment], optional) – Data for ingestibles. Defaults to None.

  • ingestibles (Optional[Ingestible], optional) – Ingestible. Defaults to None.

descriptives(**kwargs)

Describe features such as the amount per label for the train, test and model predictions and text data specific features such as the maximum/minimum/mean amount of words in a sample and the standard deviation.

Returns:

Descriptive statistics of each split.

Return type:

Descriptives

instances(split='test', **kwargs)

Get the instances of the given split.

Parameters:

split (str, optional) – Split to select. Defaults to “test”.

Returns:

Instances in the split.

Return type:

Dataset