The ax datasets commands are currently in BETA. The API may change without notice. A one-time warning is emitted on first use.
The ax datasets commands let you create, retrieve, and manage datasets on the Arize platform. Supported file formats: CSV, JSON, JSON Lines, Parquet.
ax datasets list
List all datasets in a space.
ax datasets list [--space <id>] [--name <filter>] [--limit <n>] [--cursor <cursor>]
| Option | Description |
|---|
--space | Filter by space name or ID |
--name | Case-insensitive substring filter on dataset name |
--limit | Maximum number of results to return (default: 15) |
--cursor | Pagination cursor for the next page |
Examples:
ax datasets list --space sp_abc123
ax datasets list --space sp_abc123 --output datasets.json
ax datasets create
Create a new dataset from a local file.
ax datasets create --name <name> --space <id> (--file <path> | --json <json-array>)
| Option | Description |
|---|
--name | Name for the new dataset |
--space | Space name or ID to create the dataset in |
--file | Path to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin |
--json | Inline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]' |
Examples:
ax datasets create --name "my-eval-set" --space sp_abc123 --file ./examples.csv
ax datasets create --name "my-eval-set" --space my-space --json '[{"q": "What is AI?"}]'
ax datasets get
Retrieve a dataset by name or ID.
ax datasets get <name-or-id> [--space <id>]
| Option | Description |
|---|
--space | Space name or ID (required when using a dataset name instead of ID) |
Example:
ax datasets get ds_xyz789
ax datasets get my-eval-set --space my-space
ax datasets delete
Delete a dataset.
ax datasets delete <name-or-id> [--space <id>] [--force]
| Option | Description |
|---|
--space | Space name or ID (required when using a dataset name instead of ID) |
--force | Skip the confirmation prompt |
Examples:
ax datasets delete ds_xyz789
ax datasets delete my-eval-set --space my-space --force
ax datasets export
Export examples from a dataset to a JSON file.
ax datasets export <name-or-id> [--space <id>] [--version-id <id>] [--output-dir <path>] [--stdout] [--all]
| Option | Description |
|---|
--space | Space name or ID (required when using a dataset name instead of ID) |
--version-id | Export examples from a specific dataset version (default: latest) |
--output-dir | Directory to write the output file (default: current directory) |
--stdout | Print JSON to stdout instead of saving to a file |
--all | Use Arrow Flight for bulk export — streams all examples |
Examples:
ax datasets export ds_xyz789
ax datasets export my-eval-set --space my-space --version-id v_123 --output-dir ./exports
ax datasets export ds_xyz789 --all
ax datasets export ds_xyz789 --stdout | jq 'length'
ax datasets append
Append examples to an existing dataset. Provide examples via --json (inline) or --file. Exactly one input source is required.
ax datasets append <name-or-id> (--json <json-array> | --file <path>) [--space <id>] [--version-id <id>]
| Option | Description |
|---|
--json | Inline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]' |
--file | Path to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin |
--space | Space name or ID (required when using a dataset name instead of ID) |
--version-id | Dataset version to append to (default: latest version) |
Examples:
# Append from a file
ax datasets append ds_xyz789 --file ./new_examples.csv
# Append inline JSON
ax datasets append ds_xyz789 --json '[{"question": "What is AI?", "answer": "..."}]'
# Append to a specific version
ax datasets append my-eval-set --space my-space --file ./more.jsonl --version-id v_123