Agent Cookbooks

Tracing and Evaluating Agents

Agent Cookbook

Build a customer support agent to trace activity, assess performance, and experiment with prompts and models.

Evaluate an Agent

Trace and evaluate a "talk-to-your-data" agent. Includes evaluations for function calling accuracy, SQL query generation, code generation, and agent execution path.

OpenAI Agents SDK Cookbook

Create an agent with the OpenAI Agents SDK, trace its activity, benchmark with datasets, run experiments, and evaluate traces in production.

Using Ragas to Evaluate a Math Problem-Solving Agent

Create an agent using the OpenAI Agents SDK, trace its interactions, and evaluate performance using Ragas.

Tracing and Evaluating a LangChain OpenAI Agent

Build your own LangChain OpenAI agent using the function-calling API and inspect the agent's internals—all in a minimal setup with conversation and tool use.

Tracing and Evaluating a LlamaIndex OpenAI Agent

Use the function-calling API to create a LlamaIndex OpenAI agent capable of conversation and tool use, and explore its behavior with Phoenix.

PreviousFeatured Tutorials NextAgent Demos

Last updated 22 days ago

Was this helpful?