Skip to main content
The ax datasets commands are currently in BETA. The API may change without notice. A one-time warning is emitted on first use.
The ax datasets commands let you create, retrieve, and manage datasets on the Arize platform. Supported file formats: CSV, JSON, JSON Lines, Parquet.

ax datasets list

List all datasets in a space.
ax datasets list [--space <id>] [--name <filter>] [--limit <n>] [--cursor <cursor>]
OptionDescription
--spaceFilter by space name or ID
--nameCase-insensitive substring filter on dataset name
--limitMaximum number of results to return (default: 15)
--cursorPagination cursor for the next page
Examples:
ax datasets list --space sp_abc123
ax datasets list --space sp_abc123 --output datasets.json

ax datasets create

Create a new dataset from a local file.
ax datasets create --name <name> --space <id> (--file <path> | --json <json-array>)
OptionDescription
--nameName for the new dataset
--spaceSpace name or ID to create the dataset in
--filePath to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin
--jsonInline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]'
Examples:
ax datasets create --name "my-eval-set" --space sp_abc123 --file ./examples.csv
ax datasets create --name "my-eval-set" --space my-space --json '[{"q": "What is AI?"}]'

ax datasets get

Retrieve a dataset by name or ID.
ax datasets get <name-or-id> [--space <id>]
OptionDescription
--spaceSpace name or ID (required when using a dataset name instead of ID)
Example:
ax datasets get ds_xyz789
ax datasets get my-eval-set --space my-space

ax datasets update

Rename a dataset.
ax datasets update <name-or-id> --name <new-name> [--space <id>]
OptionDescription
--nameNew name for the dataset
--spaceSpace name or ID (required when using a dataset name instead of ID)
Examples:
ax datasets update ds_xyz789 --name "renamed-eval-set"
ax datasets update my-eval-set --space my-space --name "renamed-eval-set"

ax datasets delete

Delete a dataset.
ax datasets delete <name-or-id> [--space <id>] [--force]
OptionDescription
--spaceSpace name or ID (required when using a dataset name instead of ID)
--forceSkip the confirmation prompt
Examples:
ax datasets delete ds_xyz789
ax datasets delete my-eval-set --space my-space --force

ax datasets export

Export examples from a dataset to a JSON file.
ax datasets export <name-or-id> [--space <id>] [--version-id <id>] [--output-dir <path>] [--stdout] [--all]
OptionDescription
--spaceSpace name or ID (required when using a dataset name instead of ID)
--version-idExport examples from a specific dataset version (default: latest)
--output-dirDirectory to write the output file (default: current directory)
--stdoutPrint JSON to stdout instead of saving to a file
--allUse Arrow Flight for bulk export — streams all examples
Examples:
ax datasets export ds_xyz789
ax datasets export my-eval-set --space my-space --version-id v_123 --output-dir ./exports
ax datasets export ds_xyz789 --all
ax datasets export ds_xyz789 --stdout | jq 'length'

ax datasets append

Append examples to an existing dataset. Provide examples via --json (inline) or --file. Exactly one input source is required.
ax datasets append <name-or-id> (--json <json-array> | --file <path>) [--space <id>] [--version-id <id>]
OptionDescription
--jsonInline JSON array of examples, e.g. '[{"question": "...", "answer": "..."}]'
--filePath to the data file (CSV, JSON, JSONL, or Parquet), or - for stdin
--spaceSpace name or ID (required when using a dataset name instead of ID)
--version-idDataset version to append to (default: latest version)
Examples:
# Append from a file
ax datasets append ds_xyz789 --file ./new_examples.csv

# Append inline JSON
ax datasets append ds_xyz789 --json '[{"question": "What is AI?", "answer": "..."}]'

# Append to a specific version
ax datasets append my-eval-set --space my-space --file ./more.jsonl --version-id v_123

ax datasets annotate-examples

Annotate a batch of examples in a dataset. Provide annotations via --file (JSON, JSONL, CSV, or Parquet; use - for stdin). Each record must have a record_id (the dataset example ID) and values (a list of annotation dicts with at least name, plus optionally score, label, or text). Annotations are upserted; up to 1000 examples may be annotated per request.
ax datasets annotate-examples <name-or-id> --file <path> [--space <id>]
OptionDescription
--filePath to a file containing annotation records (JSON, JSONL, CSV, Parquet), or - for stdin
--spaceSpace name or ID (required when using a dataset name instead of ID)
Examples:
ax datasets annotate-examples ds_xyz789 --file ./annotations.jsonl
ax datasets annotate-examples my-eval-set --space my-space --file ./annotations.json