OnyxDataset to the Onyx Engine, making it available for model training.
Parameters
The name for the new dataset. Must be a non-empty string.
The
OnyxDataset object containing the dataframe and metadata.Source datasets used to create this dataset, for lineage tracking. Each dictionary should have:
name(str): Name of the source datasetversion_id(str, optional): Specific version, or latest if not provided
The time format of your data. Options:
"s"- Seconds (default)"ms"- Milliseconds"us"- Microseconds"ns"- Nanoseconds"datetime"- Python datetime objects"none"- No time column
Returns
None. Prints a confirmation message when upload completes.Raises
Exception: If the dataset dataframe is emptyException: If the name is an empty stringException: If a source dataset is not found in the Engine
Example
With Source Tracking
Notes
- The dataset is saved locally before uploading (in
~/.onyx/datasets/) - Float64 columns are automatically converted to float32 for efficiency
- The dataset becomes available for training once processing completes