OnyxDataset from the Onyx Engine. Uses local cache when available.
Parameters
The name of the dataset to load.
The specific version ID to load. If
None, loads the latest version.Returns
The loaded dataset with:
dataframe: pandas DataFrame with the dataconfig: OnyxDatasetConfig with features and dt
Raises
Exception: If the dataset is not found in the EngineException: If the dataset status is not “active”
Example
Load Specific Version
Caching Behavior
The SDK caches datasets locally in~/.onyx/datasets/:
- If the dataset exists locally and matches the requested version, it’s loaded from cache
- If the local version is outdated, the new version is downloaded
- If requesting
None(latest) and local version matches latest, cache is used
Notes
- Datasets must have status “active” to be loaded
- The returned DataFrame preserves the original column order and data types