Only this pageAll pages
Powered by GitBook
1 of 38

Release Notes

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

Loading...

04.30.2025: Span Querying & Data Extraction for Phoenix Client 📊

Available in Phoenix 8.30+

The Phoenix client now includes the SpanQuery DSL, enabling more advanced and flexible span querying for distributed tracing and telemetry data. This allows users to perform complex queries on span data, improving trace analysis and debugging.

In addition, the get_spans_dataframe method has been migrated, offering an easy-to-use way to extract span-related information as a Pandas DataFrame. This simplifies data processing and visualization, making it easier to analyze trace data within Python-based environments.

Improvements and Bug Fixes 🐛

: Add "Copy Name" button to project menu

: Add independent flags for whether TLS is enabled for HTTP and gRPC servers

: Log playground subscription errors

: New RBAC primitives have been introduced for FastAPI and REST APIs

Projects
TLS
Playground
API

04.28.2025: Improved Shutdown Handling 🛑

Available in Phoenix 8.28+

When stopping the Phoenix server via Ctrl+C, the shutdown process now exits cleanly without displaying a traceback or returning a non-zero exit code. Previously, a KeyboardInterrupt and CancelledError traceback could appear, ending the process with status code 130. The server now swallows the interrupt for a smoother shutdown experience, exiting with code 0 by default to reflect intentional termination.

Improvements and Bug Fixes 🐛

04.28.2025: TLS Support for Phoenix Server 🔐

Available in Phoenix 8.29+

Phoenix now supports Transport Layer Security (TLS) for both HTTP and gRPC connections, enabling encrypted communication and optional mutual TLS (mTLS) authentication. This enhancement provides a more secure foundation for production deployments.

Highlights:

  • Secure HTTP & gRPC Connections: Phoenix can now serve over HTTPS and secure gRPC.

  • Flexible TLS Configuration: TLS settings are managed via environment variables.

  • Optional Client Verification: Support for mTLS with configurable client certificate validation.

  • Improved Testing: TLS-aware infrastructure added to integration tests.

  • Better Visibility: Server startup logs now display TLS status.

Configuration Options

Set the following environment variables to enable and customize TLS:

Variable
Type
Description

PHOENIX_TLS_ENABLED

boolean

Enable or disable TLS (true/false)

PHOENIX_TLS_CERT_FILE

string

Path to TLS certificate file

PHOENIX_TLS_KEY_FILE

string

Path to private key file

PHOENIX_TLS_KEY_FILE_PASSWORD

string

Password for encrypted private key file

PHOENIX_TLS_CA_FILE

string

Path to CA certificate (for client verification)

PHOENIX_TLS_VERIFY_CLIENT

boolean

Enable client cert verification

Note: Encrypted private keys require the cryptography Python package for decryption.

04.18.2025: Tracing for MCP Client-Server Applications 🔌

Available in Phoenix 8.26+

What’s New?

This release introduces automatic distributed tracing for Anthropic’s Model Context Protocol (MCP). Using OpenTelemetry, you can now:

  • Propagate context across MCP client-server boundaries

  • Generate end-to-end traces of your AI system across services and languages

  • Gain full visibility into how models access and use external context

The openinference-instrumentation-mcp package handles this for you by:

  • Creating spans for MCP client operations

  • Injecting trace context into MCP requests

  • Extracting and continuing the trace context on the server

  • Associating the context with OTEL spans on the server side

Set up

  1. Instrument both MCP client and server with OpenTelemetry.

  2. Add the openinference-instrumentation-mcp package.

  3. Spans will propagate across services, appearing as a single connected trace in Phoenix.

Full example usage is available:

Walkthrough Video

Acknowledgments

Big thanks to Adrian Cole and Anuraag Agrawal for their contributions to this feature.

04.01.2025: Support for MCP Span Tool Info in OpenAI Agents SDK 🔨

Available in Phoenix 8.20+

Newly added to the OpenAI Agent SDK is support for MCP Span Info, allowing for the tracing and extraction of useful information about MCP tool listings. Use the Phoenix OpenAI Agents SDK for powerful agent tracing.

05.09.2025: Annotations, Data Retention Policies, Hotkeys 📓

Phoenix v9.0.0 release brings major updates to annotation support, and a whole host of other improvements.

Up until now, Phoenix has only supported one annotation of a given type on each trace. We've now unlocked that limit, allowing you to capture multiple values of an annotation label on each span.

In addition, we've added:

  • API support for annotations - create, query, and update annotations through the REST API

  • Additional support for code evaluations as annotations

  • Support for arbitrary metadata on annotations

  • Annotation configurations to structure your annotations within and across projects

Data Retention 💿

Now you can create custom global and per-project data retention polices to remove traces after a certain window of time, or based on number of traces. Additionally, you can now view your disk usage in the Settings page of Phoenix.

Hotkeys 🔥

We've added hotkeys to Phoenix!

You can now use j and k to quickly page through your traces, and e and n to add annotations and notes - you never have to lift your hands off the keyboard again!

Full v9.0.0 Release

04.15.2025: Display Tool Call and Result IDs in Span Details 🫆

Available in Phoenix 8.25+

Improvements and Bug Fixes 🐛

  • Performance: Do not refetch tables when trace and span details closed

  • UI: Redirect /v1/traces to root path

  • Playground: Update GPT-4.1 models in Playground

05.20.2025: Datasets and Experiment Evaluations in the JS Client

We've added a host of new methods to the JS client:

Full list of supported JS/TS Client Methods:

04.16.2025: API Key Generation via API 🔐

Available in Phoenix 8.26+

Phoenix now supports programmatic API key creation through a new endpoint, making it easier to automate project setup and trace logging. To enable this, set the PHOENIX_ADMIN_SECRET environment variable in your deployment.

Improvements and Bug Fixes 🐛

03.19.2025: Access to New Integrations in Projects 🔌

Available in Phoenix 8.15+

03.14.2025: OpenAI Agents Instrumentation 📡

Available in Phoenix 8.13+

We've introduced the OpenAI Agents SDK for Python which provides enhanced visibility into agent behavior and performance.

Installation

pip install openinference-instrumentation-openai-agents openai-agents
  • Includes an OpenTelemetry Instrumentor that traces agents, LLM calls, tool usage, and handoffs.

  • With minimal setup, use the register function to connect your app to Phoenix and view real-time traces of agent workflows.

For more details on a quick setup, check out our integration documentation:

Walkthrough Video

Improvements and Bug Fixes 🐛

03.20.2025: Delete Experiment from Action Menu 🗑️

Available in Phoenix 8.19+

Improvements and Bug Fixes 🐛

03.27.2025 Span View Improvements 👀

Available in Phoenix 8.20+

Improvements and Bug Fixes 🐛

03.18.2025: Resize Span, Trace, and Session Tables 🔀

Available in Phoenix 8.14+

We've added the ability to resize Span, Trace, and Session tables. Resizing preferences are now persisted in the tracing store, ensuring settings are maintained per-project and per-table.

Improvements and Bug Fixes 🐛

03.06.2025: Project Improvements 📽️

Available in Phoenix 8.5+

We’ve introduced several enhancements to Projects, providing greater flexibility and control over how you interact with data. These updates include:

  • Custom Time Ranges: You can now specify custom time ranges to filter traces and spans.

  • Root Span Filter for Spans: Improved filtering options allow you to filter by root spans, helping to isolate and debug issues more effectively.

Improvements and Bug Fixes 🐛

03.07.2025: Model Config Enhancements for Prompts 💡

Available in Phoenix 8.11+

  • Save and Load from Prompts: You can now save and load configurations directly from prompts.

  • Save and Load from Default Model Config: Default model configurations can be saved and loaded.

  • Budget Token Management: Added the ability to adjust the budget token value.

  • Thinking Configuration Toggle: You can now enable or disable the “thinking” feature.

Important Note: The default model config does not automatically apply to saved prompts. To include default thinking settings, ensure they are saved within the specific prompt.

Improvements and Bug Fixes 🐛

07.11.2024: Hosted Phoenix and LlamaTrace 💻

Phoenix is now available for deployment as a fully hosted service.

With hosted instances, your data is stored between sessions, and you can easily share your work with team members.

We are partnering with LlamaIndex to power a new observability platform in LlamaCloud: LlamaTrace. LlamaTrace will automatically capture traces emitted from your LlamaIndex applications, and store them in a persistent, cloud- accessible Phoenix instance.

04.09.2025: Project Management API Enhancements ✨

Available in Phoenix 8.24+

This update enhances the Project Management API with more flexible project identification:

  • Enhanced project identification: Added support for identifying projects by both ID and hex-encoded name and introduced a new _get_project_by_identifier helper function

Also includes streamlined operations, better validation & error handling, and expanded test coverage.

Improvements and Bug Fixes 🐛

04.03.2025: Phoenix Client Prompt Tagging 🏷️

Available in Phoenix 8.22+

We’ve added support for Prompt Tagging in the Phoenix client. This new feature gives you more control and visibility over your prompts throughout the development lifecycle.

  • Tag prompts directly in your code and see those tags reflected in the Phoenix UI.

  • Label prompt versions as development, staging, or production — or define your own custom tags.

  • Add tag descriptions to provide additional context or list out all tags.

Improvements and Bug Fixes 🐛

04.25.2025: Scroll Selected Span Into View 🖱️

Available in Phoenix 8.27+

Improved trace navigation by automatically scrolling the selected span into view when a user navigates to a specific trace. This enhancement eliminates the need for manual searching or scrolling, allowing users to immediately focus on the span of interest. It's especially useful when navigating from links or alerts that point to a specific span, improving debugging efficiency. This change contributes to a smoother and more intuitive trace exploration experience.

Improvements and Bug Fixes 🐛

04.02.2025 Improved Span Annotation Editor ✍️

Available in Phoenix 8.21+

The new span aside moves the Span Annotation editor into a dedicated panel, providing a clearer view for adding annotations and enhancing customization of your setup. Read this documentation to learn how annotations can be used.

Improvements and Bug Fixes 🐛

05.30.2025: xAI and Deepseek Support in Playground

Phoenix v10.5.0 now supports Deepseek and xAI models in Playground natively. Previous versions of Phoenix supported these as custom model endpoints, but that process has now been streamlined to offer these model providers from the main Playground dropdown.

03.24.2025: Tracing Configuration Tab 🖌️

Available in Phoenix 8.19+

Within each project, there is now a Config tab to enhance customization. The default tab can now be set per project, ensuring the preferred view is displayed.

Improvements and Bug Fixes 🐛

  • Experiments: Included delete experiment option to action menu

  • Feature: Added support for specifying admin users via an environment variable at startup

  • Annotation: Now displays metadata

  • Settings Page: Now split across tabs for improved navigation and easier access

  • Feedback: Added full metadata

  • Projects: Improved performance

  • UI: Added date format descriptions to explanations

03.21.2025: Environment Variable Based Admin User Configuration 🗝️

Available in Phoenix 8.17+

You can now specify one or more admin users at startup using an environment variable. This is especially useful for managed deployments, allowing you to define admin access in a manifest or configuration file. The specified users will be automatically seeded into the database, enabling immediate login without manual setup.

Improvements and Bug Fixes 🐛

: Use Float for token count summaries

: Improve browser compatibility for table sizing

: Simplify homeLoaderQuery to prevent idle timeout errors

We’re excited to announce a powerful capability in the OSS library openinference-instrumentation-mcp — seamless OTEL context propagation for MCP clients and servers.

🏷️

Tool call and result IDs are now shown in the span details view. Each ID is placed within a collapsible header and can be easily copied. This update also supports spans with multiple tool calls. Get started with tracing your tool calls .

- allows you to retrieve an Experiment to view its results, and run evaluations on it

- allows you to evaluate previously run Experiments using LLM as a Judge or Code-based evaluators

- allows you to create Datasets in Phoenix using the client

- allows you to append additional examples to a Dataset

: Add load more and loading state to the infinite scroll

: Hide menu for changing role for self in UsersTable

: Prevent admins from changing their own roles

: Remove WebSocket dependency and migrate to Multipart Subscriptions

In the New Project tab, we've added quick setup to instrument your application for , , and the .

Easily configure all integrations with streamlined instructions. Check out all Phoenix here.

: Azure API key made optional, included specialized UI for thinking budget parameter

: Make the spans table the default tab

: Added react-aria Tabs components

: Download experiment runs and annotations as CSV

You can now delete experiments directly from the action menu, making it quicker to manage and clean up your workspace. This update streamlines experiment management by reducing the steps needed to remove outdated or unnecessary runs. Get started with experiments .

: Show the date format in the explanation

You can now toggle the option to treat orphan spans as root when viewing your spans. Additionally, we've enhanced the UI with an icon view in span details for better visibility in smaller displays. Learn more in our .

: Disable streaming when a dialog is open

: Removed unpredictable playground transformations

: Remove shadow on button group

: Fixed broken popovers

: Your selected columns will now remain consistent across sessions, ensuring a more seamless workflow.

: Easily filter data directly from the table view using metadata attributes.

: Quickly apply common metadata filters for faster navigation.

: Major speed improvements in project tracing views & visibility into database usage in settings

: Query to get number of spans for each trace

: Show + n more spans in trace table

: Add Token component

: Remove double fetching of spans

: Don't fetch new traces when the traces slideover is visible

: Fix scrolling on trace tree

: Added annotations to experiment JSON downloads

: Add none as option for tool choice for anthropic 0.49.0

: Port slider component to react-aria

In addition to our existing notebook, CLI, and self-hosted deployment options, we’re excited to announce that Phoenix is now available as a .

Hosted Phoenix is 100% free-to-use, .

: Restore streaming

: update Gemini models

: Route user to forgot-password page in welcome email url

Check out documentation on .

: Add aiohttp to container for azure-identity

: Add /readyz endpoint to confirm database connectivity

: Allow scroll on settings page

: Allow the option to have no configured working directory when using Postgres

: Cache project table results when toggling the details slide-over for improved performance

: Add chat and message components for note-taking

Learn more in .

: Use correlated subquery for orphan spans

: Add toggle to treat orphan spans as root

: Upgrade react-router, vite, vitest

: Smaller page sizes

: Improved performance on projects page

: Allow hover anywhere on experiment cell

: Show metadata

: Show full metadata

Fix
Enhancement
UX
OpenInference
Annotations
here
getExperiment
evaluateExperiment
createDataset
appendDatasetExamples
Tracing
UI
Security
Infrastructure
Prompt Playground
Performance
Components
Enhancement
UI
Performance
Playground
UI
UI
Persistent Column Selection on Tables
Metadata Filters from the Table
Metadata Quick Filters
Performance
GraphQL
Performance
Components
Performance
Performance
UI
Experiments
Playground
UI
fully hosted service
check it out today
Performance
Playground
Enhancement
prompt tags
Infrastructure
Enhancement
Enhancement
Enhancement
Performance
UI
Tracing
Spans
Performance
Performance
Projects
Experiments
Annotations
Feedback

04.09.2025: New REST API for Projects with RBAC 📽️

Available in Phoenix 8.23+

This release introduces a REST API for managing projects, complete with full CRUD functionality and access control. Key features include:

  • CRUD Operations: Create, read, update, and delete projects via the new API endpoints.

  • Role-Based Access Control:

    • Admins can create, read, update, and delete projects

    • Members can create and read projects, but cannot modify or delete them.

  • Additional Safeguards: Immutable Project Names, Default Project Protection, Comprehensive Integration Tests

Improvements and Bug Fixes 🐛

12.09.2024: Sessions 💬

Available in Phoenix 7.0+

Sessions allow you to group multiple responses into a single thread. Each response is still captured as a single trace, but each trace is linked together and presented in a combined view.

Bug Fixes and Improvements 🐛

  • Prompt Playground: Added support for arbitrary string model names Added support for Gemini 2.0 Flash Improved template editor ergonomics

  • Evals: Added multimodal message template support

  • Tracing: Added JSON pretty printing for structured data outputs (thank you sraibagiwith100x!) Added a breakdown of token types in project summary

  • Bug Fixes: Changed trace latency to be computed every time, rather than relying on root span latency, Added additional type checking to handle non-string values when manually instrumenting (thank you Manuel del Verme!)

05.14.2025: Experiments in the JS Client

You can now run Experiments using the Phoenix JS client! Use Experiments to test different iterations of your applications over a set of test cases, then evaluate the results.

This release includes:

  • Native tracing of tasks and evaluators

  • Async concurrency queues

  • Support for any evaluator (including bring your own evals)

Code Implementation

import { createClient } from "@arizeal/phoenix-client";
import {
  asEvaluator,
  runExperiment,
} from "@arizeal/phoenix-client/experiments";
import type { Example } from "@arizeal/phoenix-client/types/datasets";
import { Factuality } from "autoevals";
import OpenAI from "openai";

const phoenix = createClient();
const openai = new OpenAI();

/** Your AI Task */
const task = async (example: Example) => {
  const response = await openai.chat.completions.create({
    model: "gpt-4o",
    messages: [
      { role: "system", content: "You are a helpful assistant." },
      { role: "user", content: JSON.stringify(example.input, null, 2) },
    ],
  });
  return response.choices[0]?.message?.content ?? "No response";
};

await runExperiment({
  dataset: "dataset_id",
  experimentName: "experiment_name",
  client: phoenix,
  task,
  evaluators: [
    asEvaluator({
      name: "Factuality",
      kind: "LLM",
      evaluate: async (params) => {
        const result = await Factuality({
          output: JSON.stringify(params.output, null, 2),
          input: JSON.stringify(params.input, null, 2),
          expected: JSON.stringify(params.expected, null, 2),
        });
        return {
          score: result.score,
          label: result.name,
          explanation: (result.metadata?.rationale as string) ?? "",
          metadata: result.metadata ?? {},
        };
      },
    }),
  ],
});

02.19.2025: Prompts 📃

Available in Phoenix 8.0+

Phoenix prompt management will now let you create, modify, tag, and version control prompts for your applications. Some key highlights from this release:

  • Versioning & Iteration: Seamlessly manage prompt versions in both Phoenix and your codebase.

  • New TypeScript Client: Sync prompts with your JavaScript runtime, now with native support for OpenAI, Anthropic, and the Vercel AI SDK.

  • New Python Client: Sync templates and apply them to AI SDKs like OpenAI, Anthropic, and more.

  • Standardized Prompt Handling: Native normalization for OpenAI, Anthropic, Azure OpenAI, and Google AI Studio.

  • Enhanced Metadata Propagation: Track prompt metadata on Playground spans and experiment metadata in dataset runs.

03.07.2025: New Prompt Playground, Evals, and Integration Support 🦾

Available in Phoenix 8.9+

New update overview:

  • Evals: o3 support, Audio & Multi-Modal Evaluations

Improvements and Bug Fixes 🐛

07.03.2024: Datasets & Experiments 🧪

Available in Phoenix 4.6+

Datasets: Datasets are a new core feature in Phoenix that live alongside your projects. They can be imported, exported, created, curated, manipulated, and viewed within the platform, and should make a few flows much easier:

  • Fine-tuning: You can now create a dataset based on conditions in the UI, or by manually choosing examples, then export these into csv or jsonl formats readymade for fine-tuning APIs.

  • Experimentation: External datasets can be uploaded into Phoenix to serve as the test cases for experiments run in the platform.

Experiments: Our new Datasets and Experiments feature enables you to create and manage datasets for rigorous testing and evaluation of your models. You can now run comprehensive experiments to measure and analyze the performance of your LLMs in various scenarios.

02.18.2025: One-Line Instrumentation⚡️

Available in Phoenix 8.0+

Phoenix has made it even simpler to get started with tracing by introducing one-line auto-instrumentation. By using register(auto_instrument=True), you can enable automatic instrumentation in your application, which will set up instrumentors based on your installed packages.

from phoenix.otel import register

register(auto_instrument=True)

07.02.2024: Function Call Evaluations ⚒️

Available in Phoenix 4.6+

We are introducing a new built-in function call evaluator that scores the function/tool-calling capabilities of your LLMs. This off-the-shelf evaluator will help you ensure that your models are not just generating text but also effectively interacting with tools and functions as intended.

This evaluator checks for issues arising from function routing, parameter extraction, and function generation.

07.18.2024: Guardrails AI Integrations💂

Available in Phoenix 4.11+

Our integration with Guardrails AI allows you to capture traces on guard usage and create datasets based on these traces. This integration is designed to enhance the safety and reliability of your LLM applications, ensuring they adhere to predefined rules and guidelines.

01.18.2025: Automatic & Manual Span Tracing ⚙️

Available in Phoenix 7.9+

In addition to using our automatic instrumentors and tracing directly using OTEL, we've now added our own layer to let you have the granularity of manual instrumentation without as much boilerplate code.

You can now access a tracer object with streamlined options to trace functions and code blocks. The main two options are:

  • Using the decorator @tracer.chain traces the entire function automatically as a Span in Phoenix. The input, output, and status attributes are set based on the function's parameters and return value.

  • Using the tracer in a with clause allows you to trace specific code blocks within a function. You manually define the Span name, input, output, and status.

05.05.2025: OpenInference Google GenAI Instrumentation

We’ve added a Python auto-instrumentation library for the Google GenAI SDK. This enables seamless tracing of GenAI workflows with full OpenTelemetry compatibility. Traces can be exported to any OpenTelemetry collector.

Installation

For more details on how to set up the tracing integration seamlessly:

Additionally, the Google GenAI instrumentor is now supported and works seamlessly with Span Replay in Phoenix, enabling deep trace inspection and replay for more effective debugging and observability.

Acknowledgements

Big thanks to Harrison Chu for his contributions.

Check out our to test these features.

: add PHOENIX_ALLOWED_ORIGINS env

: Delete annotations in the feedback table, Make feedback table scrollable

: Allow scrolling the entire experiment compare table

: Make time range selector more accessible

: Don't close model settings dialog when picking Azure version

: improve PostgreSQL error message in launch_app

Sessions make it easier to visual multi-turn exchanges with your chatbot or agent Sessions launches with Python and TS/JS support. For more on sessions, check out and the .

Check out the docs and this for more on prompts.📝

Prompt Playground: Now supports & Anthropic Sonnet 3.7 and Thinking Budgets

Instrumentation: to trace smolagents by Hugging Face

Integrations: Phoenix now supports &

: Show percent used of DB

: Add environment variable for allocated DB storage capacity

: Delete selected traces

: Make trace tree more readable on smaller sizes

: Ensure type is correct on run_experiment

: Allow experiment run JSON downloads

: Add anthropic thinking config param

: Add ToggleButton

For more details on using datasets see our or .

For more details, check out our full .

For more details, you can check the docs and explore further options.

Check out a .

Check out the here.

Check out the for more on how to use tracer objects.

new documentation
Phoenix Server
Tracing
Experiments
Projects
Playground
Session
a walkthrough video
docs
walkthrough
GPT-4.5
Admin
Configuration
Tracing
Tracing
Experiments
Experiments
Python Client
Components
documentation
example notebook
walkthrough
full walkthrough of the evaluator
Cookbook
docs
pip install openinference-instrumentation-google-genai
Annotation Improvements
Annotation Configs

09.26.2024: Authentication & RBAC 🔐

Available in Phoenix 5.0+

We've added Authentication and Rules-based Access Controls to Phoenix. This was a long-requested feature set, and we're excited for the new uses of Phoenix this will unlock!

The auth feature set includes:

  • Secure Access: All of Phoenix’s UI & APIs (REST, GraphQL, gRPC) now require access tokens or API keys. Keep your data safe!

  • RBAC (Role-Based Access Control): Admins can manage users; members can update their profiles—simple & secure.

  • API Keys: Now available for seamless, secure data ingestion & querying.

  • OAuth2 Support: Easily integrate with Google, AWS Cognito, or Auth0. ✉ Password Resets via SMTP to make security a breeze.

Bug Fixes and Improvements 🐛

  • Numerous stability improvements to our hosted Phoenix instances accessed on app.phoenix.arize.com

  • Added a new command to easily launch a Phoenix client from the cli: phoenix serve

  • Implemented simple email sender to simplify dependencies

  • Improved error handling for imported spans

  • Replaced hdbscan with fast-hdbscan Added PHOENIX_CSRF_TRUSTED_ORIGINS environment variable to set trusted origins

  • Added support for Mistral 1.0

  • Fixed an issue that caused px.Client().get_spans_dataframe() requests to time out

11.18.2024: Prompt Playground 🛝

Available in Phoenix 6.0+

Sessions allow you to group multiple responses into a single thread. Each response is still captured as a single trace, but each trace is linked together and presented in a combined view.

Bug Fixes and Improvements 🐛

  • Added support for FastAPI and GraphQL extensions

  • Fixed a bug where Anthropic LLM as a Judge responses would be labeled as unparseable

  • Fixed a bug causing 500 errors on client.get_traces_dataset() and client.get_spans_dataframe()

  • Added the ability for authentication to work from behind a proxy

  • Added an environment variable to set default admin passwords in auth

For all the details on authentication, view our .

Sessions make it easier to visual multi-turn exchanges with your chatbot or agent Sessions launches with Python and TS/JS support. For more on sessions, check out and the .

docs
a walkthrough video
docs
@arizeai/phoenix-client
feat: Add `SpanQuery` DSL to phoenix client and include `get_spans_dataframe` to client by anticorrelator · Pull Request #7071 · Arize-ai/phoenixGitHub
feat: gracefully handle ctrl-c by codefromthecrypt · Pull Request #7305 · Arize-ai/phoenixGitHub
phoenix/tutorials/mcp/tracing_between_mcp_client_and_server at main · Arize-ai/phoenixGitHub
Release arize-phoenix: v9.0.0 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.25.0 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.26.0 · Arize-ai/phoenixGitHub
feat: capture result from MCPListToolsSpanData by RogerHYang · Pull Request #1458 · Arize-ai/openinferenceGitHub
feat: environment variables for TLS by RogerHYang · Pull Request #7296 · Arize-ai/phoenixGitHub

Release Notes

The latest from the Phoenix team.

Available in Phoenix v10.5.0+

Deepseek and xAI models are now available in Prompt Playground!

We've added a host of new methods to the JS client:

You can now run Experiments using the Phoenix JS client! Use Experiments to test different iterations of your applications over a set of test cases, then evaluate the results.

This release includes:

  • Native tracing of tasks and evaluators

  • Async concurrency queues

  • Support for any evaluator (including bring your own evals)


Major Release: Phoenix v9.0.0

Available in Phoenix 9.0.0+

Phoenix's v9.0.0 release brings with it:

  • Customizable data retention policies


We’ve added a Python auto-instrumentation library for the Google GenAI SDK. This enables seamless tracing of GenAI workflows with full OpenTelemetry compatibility. Additionally, the Google GenAI instrumentor is now supported and works seamlessly with Span Replay in Phoenix.


Available in Phoenix 8.30+

The Phoenix client now includes the SpanQuery DSL for more advanced span querying. Additionally, a get_spans_dataframe method has been added to facilitate easier data extraction for span-related information.


Available in Phoenix 8.29+

Phoenix now supports Transport Layer Security (TLS) for both HTTP and gRPC connections, enabling encrypted communication and optional mutual TLS (mTLS) authentication. This enhancement provides a more secure foundation for production deployments.


Available in Phoenix 8.28+

When stopping the Phoenix server via Ctrl+C, the shutdown process now exits cleanly with code 0 to reflect intentional termination. Previously, this would trigger a traceback with KeyboardInterrupt, misleadingly indicating a failure.


Available in Phoenix 8.27+

Improved trace navigation by automatically scrolling the selected span into view when a user navigates to a specific trace. This enhances usability by making it easier to locate and focus on the relevant span without manual scrolling.


Available in Phoenix 8.26+

We’ve released openinference-instrumentation-mcp, a new package in the OpenInference OSS library that enables seamless OpenTelemetry context propagation across MCP clients and servers. It automatically creates spans, injects and extracts context, and connects the full trace across services to give you complete visibility into your MCP-based AI systems.

Big thanks to Adrian Cole and Anuraag Agrawal for their contributions to this feature.


Available in Phoenix 8.26+

Phoenix now supports programmatic API key creation through a new endpoint, making it easier to automate project setup and trace logging. To enable this, set the PHOENIX_ADMIN_SECRET environment variable in your deployment.


Available in Phoenix 8.25+


Available in Phoenix 8.24+

This update enhances the Project Management API with more flexible project identification We've added support for identifying projects by both ID and hex-encoded name and introduced a new _get_project_by_identifier helper function.


Available in Phoenix 8.23+


Available in Phoenix 8.22+


Available in Phoenix 8.21+

The new span aside moves the Span Annotation editor into a dedicated panel, providing a clearer view for adding annotations and enhancing customization of your setup. Read this documentation to learn how annotations can be used.


Available in Phoenix 8.20+

Newly added to the OpenAI Agent SDK is support for MCP Span Info, allowing for the tracing and extraction of useful information about MCP tool listings. Use the Phoenix OpenAI Agents SDK for powerful agent tracing.


Available in Phoenix 8.20+


Available in Phoenix 8.19+


Available in Phoenix 8.17+

You can now preconfigure admin users at startup using an environment variable, making it easier to manage access during deployment. Admins defined this way are automatically seeded into the database and ready to log in.


03.20.2025: Delete Experiment from Action Menu 🗑️

Available in Phoenix 8.16+

You can now delete experiments directly from the action menu, making it quicker to manage and clean up your workspace.


Available in Phoenix 8.15+


Available in Phoenix 8.14+

We've added the ability to resize Span, Trace, and Session tables. Resizing preferences are now persisted in the tracing store, ensuring settings are maintained per-project and per-table.


Available in Phoenix 8.13+

pip install openinference-instrumentation-openai-agents openai-agents

Available in Phoenix 8.11+

You can now save and load configurations directly from prompts or default model settings. Additionally, you can adjust the budget token value and enable/disable the "thinking" feature, giving you more control over model behavior and resource allocation.


Available in Phoenix 8.9+

Prompt Playground now supports new GPT and Anthropic models new models with enhanced configuration options. Instrumentation options have been improved for better traceability, and evaluation capabilities have expanded to cover Audio & Multi-Modal Evaluations. Phoenix also introduces new integration support for LiteLLM Proxy & Cleanlabs evals.


Available in Phoenix 8.8+

We’ve rolled out several enhancements to Projects, offering more flexibility and control over your data. Key updates include persistent column selection, advanced filtering options for metadata and spans, custom time ranges, and improved performance for tracing views. These changes streamline workflows, making data navigation and debugging more efficient.


Available in Phoenix 8.0+

Phoenix prompt management will now let you create, modify, tag, and version control prompts for your applications. Some key highlights from this release:

  • Versioning & Iteration: Seamlessly manage prompt versions in both Phoenix and your codebase.

  • New TypeScript Client: Sync prompts with your JavaScript runtime, now with native support for OpenAI, Anthropic, and the Vercel AI SDK.

  • New Python Client: Sync templates and apply them to AI SDKs like OpenAI, Anthropic, and more.

  • Standardized Prompt Handling: Native normalization for OpenAI, Anthropic, Azure OpenAI, and Google AI Studio.

  • Enhanced Metadata Propagation: Track prompt metadata on Playground spans and experiment metadata in dataset runs.


Available in Phoenix 8.0+

Phoenix has made it even simpler to get started with tracing by introducing one-line auto-instrumentation. By using register(auto_instrument=True), you can enable automatic instrumentation in your application, which will set up instrumentors based on your installed packages.

from phoenix.otel import register

register(auto_instrument=True)

Available in Phoenix 7.9+

In addition to using our automatic instrumentors and tracing directly using OTEL, we've now added our own layer to let you have the granularity of manual instrumentation without as much boilerplate code.

You can now access a tracer object with streamlined options to trace functions and code blocks. The main two options are using the decorator @tracer.chain and using the tracer in a with clause.


Available in Phoenix 7.0+

Sessions allow you to group multiple responses into a single thread. Each response is still captured as a single trace, but each trace is linked together and presented in a combined view.


Available in Phoenix 6.0+

Prompt Playground is now available in the Phoenix platform! This new release allows you to test the effects of different prompts, tools, and structured output formats to see which performs best.

  • Replay individual spans with modified prompts, or run full Datasets through your variations.

  • Easily test different models, prompts, tools, and output formats side-by-side, directly in the platform.


Available in Phoenix 5.0+

We've added Authentication and Rules-based Access Controls to Phoenix. This was a long-requested feature set, and we're excited for the new uses of Phoenix this will unlock!


Available in Phoenix 4.11.0+

Our integration with Guardrails AI allows you to capture traces on guard usage and create datasets based on these traces. This integration is designed to enhance the safety and reliability of your LLM applications, ensuring they adhere to predefined rules and guidelines.


Phoenix is now available for deployment as a fully hosted service.

We are partnering with LlamaIndex to power a new observability platform in LlamaCloud: LlamaTrace. LlamaTrace will automatically capture traces emitted from your LlamaIndex application.


Available in Phoenix 4.6+

Datasets: Datasets are a new core feature in Phoenix that live alongside your projects. They can be imported, exported, created, curated, manipulated, and viewed within the platform, and make fine-turning and experimentation easier.n


Available in Phoenix 4.6+

OpenAI Agents SDK | Phoenix
feat(tracing): scroll selected span into view when navigating to a trace by mikeldking · Pull Request #7227 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.22.0 · Arize-ai/phoenixGitHub
feat: allow project name as identifier in REST path by RogerHYang · Pull Request #7064 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.17.0 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.16.0 · Arize-ai/phoenixGitHub
feat: Move Span Annotation Editor into Span Aside by cephalization · Pull Request #6937 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.14.0 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.11.0 · Arize-ai/phoenixGitHub
feat(tracing): add a config tab by mikeldking · Pull Request #6857 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.20.0 · Arize-ai/phoenixGitHub

- allows you to retrieve an Experiment to view its results, and run evaluations on it

- allows you to evaluate previously run Experiments using LLM as a Judge or Code-based evaluators

- allows you to create Datasets in Phoenix using the client

- allows you to append additional examples to a Dataset

A host of improvements to , including one-to-many support, API access, annotation configs, and custom metadata

Hotkeys!

🔌

Tool call and result IDs are now shown in the span details view. Each ID is placed within a collapsible header and can be easily copied. This update also supports spans with multiple tool calls. Get started with tracing your tool calls .

This release introduces a REST API for managing projects, complete with full CRUD functionality and access control. Key features include CRUD Operations and Role-Based Access Control. Check out our to test these features.

We’ve added support for Prompt Tagging in the Phoenix client. This new feature gives you more control and visibility over your prompts throughout the development lifecycle. Tag prompts directly in code, label prompt versions, and add tag descriptions. Check out documentation on .

You can now toggle the option to treat orphan spans as root when viewing your spans. Additionally, we've enhanced the UI with an icon view in span details for better visibility in smaller displays. Learn more .

Within each project, there is now a Config tab to enhance customization. The default tab can now be set per project, ensuring the preferred view is displayed. Learn more in .

In the New Project tab, we've added quick setup to instrument your application for BeeAI, SmolAgents, and the OpenAI Agents SDK. Easily configure these integrations with streamlined instructions. Check out all Phoenix here.

We've introduced the OpenAI Agents SDK for Python which provides enhanced visibility into agent behavior and performance. For more details on a quick setup, check out our .

💡

Check out docs for more.

Check out the docs and this for more on prompts!📝

Check out the for more on how to use tracer objects.

Sessions make it easier to visual multi-turn exchanges with your chatbot or agent Sessions launches with Python and TS/JS support. For more on sessions, check out and the .

Automatically capture traces as Experiment runs for later debugging. See for more information on Prompt Playground, or jump into the platform to try it out for yourself.

The auth feature set includes secure access, RBAC, API keys, and OAuth2 Support. For all the details on authentication, view our .

Check out the here.

In addition to our existing notebook, CLI, and self-hosted deployment options, we’re excited to announce that Phoenix is now available as a . With hosted instances, your data is stored between sessions, and you can easily share your work with team members.

Hosted Phoenix is 100% free-to-use, !

For more details on using datasets see our or .

Experiments: Our new Datasets and Experiments feature enables you to create and manage datasets for rigorous testing and evaluation of your models. Check out our full .

We are introducing a new built-in function call evaluator that scores the function/tool-calling capabilities of your LLMs. This off-the-shelf evaluator will help you ensure that your models are not just generating text but also effectively interacting with tools and functions as intended. Check out a .

🔥
05.30.2025: xAI and Deepseek Support in Playground
05.20.2025: Datasets and Experiment Evaluations in the JS Client
getExperiment
evaluateExperiment
createDataset
appendDatasetExamples
Annotations
04.30.2025: Span Querying & Data Extraction for PX Client 📊
04.28.2025: TLS Support for Phoenix Server 🔐
04.28.2025: Improved Shutdown Handling 🛑
04.25.2025: Scroll Selected Span Into View 🖱️
04.18.2025: Tracing for MCP Client-Server Applications
04.16.2025: API Key Generation via API 🔐
04.15.2025: Display Tool Call and Result IDs in Span Details 🫆
here
04.09.2025: Project Management API Enhancements ✨
04.09.2025: New REST API for Projects with RBAC 📽️
new documentation
04.03.2025: Phoenix Client Prompt Tagging 🏷️
prompt tags
04.02.2025 Improved Span Annotation Editor ✍️
04.01.2025: Support for MCP Span Tool Info in OpenAI Agents SDK 🔨
03.27.2025 Span View Improvements 👀
03.24.2025: Tracing Configuration Tab 🖌️
03.21.2025: Environmental Variable Based Admin User Configuration 🗝️
03.19.2025: Access to New Integrations in Projects 🔌
03.18.2025: Resize Span, Trace, and Session Tables 🔀
03.14.2025: OpenAI Agents Instrumentation 📡
03.07.2025: Model Config Enhancements for Prompts
03.07.2025: New Prompt Playground, Evals, and Integration Support 🦾
03.06.2025: Project Improvements 📽️
02.19.2025: Prompts 📃
walkthrough
02.18.2025: One-Line Instrumentation⚡️
01.18.2025: Automatic & Manual Span Tracing ⚙️
docs
12.09.2024: Sessions 💬
a walkthrough video
docs
11.18.2024: Prompt Playground 🛝
here
09.26.2024: Authentication & RBAC 🔐
docs
07.18.2024: Guardrails AI Integrations💂
Cookbook
07.11.2024: Hosted Phoenix and LlamaTrace 💻
fully hosted service
check it out today
07.03.2024: Datasets & Experiments 🧪
documentation
example notebook
walkthrough
07.02.2024: Function Call Evaluations ⚒️
full walkthrough of the evaluator
Logo
Experiments CLI output
Google GenAI | Phoenix
openinference-instrumentation-google-genaiPyPI
Logo
Logo
Logo
Logo
Logo
Logo
feat: REST API for CRUD operations on projects by RogerHYang · Pull Request #7006 · Arize-ai/phoenixGitHub
Release arize-phoenix: v8.0.0 · Arize-ai/phoenixGitHub
Logo
Logo
Logo
Logo
Logo
Logo
Logo
Experiments CLI output
Annotation Improvements
Releases · Arize-ai/phoenixGitHub
Logo
Logo
Logo
Logo
Logo
Logo
Logo
Logo
Logo
SmolAgents
BeeAI
OpenAI Agents SDK
05.05.2025: OpenInference Google GenAI Instrumentation
Logo
tracing integrations
05.09.2025: Annotations, Data Retention Policies, Hotkeys
05.14.2025: Experiments in the JS Client
here
tracing documentation
projects docs
SmolagentsInstrumentor
LiteLLM Proxy
Cleanlabs evals
tracing
projects docs
docs
tracing integrations
projects
here