Ctrlk

TypeScript API Python API Community GitHub Phoenix Cloud

Documentation
Phoenix Cloud
Cookbooks
Integrations
SDK and API Reference
Self-Hosting
Release Notes

Cookbooks
Agent Workflow Patterns
AI Engineering Workflows
Tracing
Human-in-the-loop Workflows (Annotations)
Prompt Engineering
Evaluation
Datasets & Experiments
Retrieval & Inferences
- Embeddings Analysis
- More Cookbooks

Powered by GitBook

On this page

Was this helpful?

Datasets & Experiments

More Cookbooks

Iteratively improve your LLM task by building datasets, running experiments, and evaluating performance using code and LLM-as-a-Judge.

Use Cases

Answer and Context Relevancy Evals
RAG with Reranker
Response Guideline Evals

PreviousText2SQL Experiments NextEmbeddings Analysis

Last updated 2 months ago

Was this helpful?

Platform

Tracing
Prompts
Datasets and Experiments
Evals

Software

Python Client
TypeScript Client
Phoenix Evals
Phoenix Otel

Resources

Container Images
X
Blue Sky
Blog

Integrations

OpenTelemetry
AI Providers

© 2025 Arize AI