Create a new dataset with JSON examples. Empty datasets are not allowed. Dataset names must be unique within a given space.
Example payload
examples[] may contain any user-defined fields.id, created_at, updated_at.
Requests that contain these fields in any example will be rejected.{} is invalid).Valid example (create)
{
"name": "my-dataset",
"spaceId": "spc_123",
"examples": [
{ "question": "What is 2+2?", "answer": "4", "topic": "math" }
]
}
Invalid example (contains system fields)
{
"name": "my-dataset",
"spaceId": "spc_123",
"examples": [
{ "id": "ex_1", "input": "Hello" } // ❌ 'id' not allowed on create
]
}
⚠️ Beta Warning: This endpoint is in beta, read more here.
Most Arize AI endpoints require authentication. For those endpoints that require authentication, include your API key in the request header using the format
Authorization: Bearer <api-key>Body containing dataset creation parameters
A dataset object
A dataset is a structured collection of examples used to test and evaluate LLM applications. Datasets allow you to test models consistently across any real-world scenarios and edge cases, quickly identify regressions, and track measurable improvements.
Unique identifier for the dataset
Name of the dataset
Unique identifier for the space this dataset belongs to
Timestamp for when the dataset was created
Timestamp for the last update of the dataset
List of versions associated with this dataset