06.2025

Arize Database (ADB)

June 25, 2025

We’re excited to introduce Arize Database (ADB), the powerful engine behind all Arize AX instances. Built for massive scale and speed, ADB processes billions of traces and petabytes of data with high efficiency.

Its robust architecture supports real-time ingestion, bulk updates, and fast querying, powering even the heaviest AI workloads reliably. ADB has long been the unsung hero of our platform, and we’re proud to bring it to light.

Playground Views

June 25, 2025

The new Prompt Playground lets you save views including prompts, dataset selections, comparison views, messages, and model selections. You can iterate and test variations seamlessly in one environment and share optimal views with your team to accelerate prompt development and evaluation.

Prompt Learning

June 25, 2025

We’re excited to launch Prompt Learning, a new workflow in Arize to accelerate prompt iteration and evaluation. With Prompt Learning, you can:

  • Run prompt optimization experiments directly in Arize

  • Incorporate text-based judgments from humans and LLMs

  • Tune and compare prompt variants to systematically improve agent behavior

More on Prompt Learning.

Agent Trajectory Evaluations

June 25, 2025

With Agent Trajectory Evaluation you can assesses the sequence of tool calls and reasoning steps your agent takes to solve a task. Key benefits:

  • Path Quality: See if your agent is following expected, efficient problem-solving paths.

  • Tool Usage Insights: Detect redundant, inefficient, or incorrect tool call patterns.

  • Debugging Visibility: Understand internal decision-making to resolve unexpected behaviors, even when outcomes appear correct.

More on Agent Trajectory Evaluations.

Session-level Evaluations

June 25, 2025

You can now evaluate your agents across entire sessions with new session-level evaluations, enabling deeper insight beyond trace-level metrics. Assess:

  • Coherence: Does the agent maintain logical consistency throughout the session?

  • Context Retention: Is it effectively remembering and building on prior exchanges?

  • Goal Achievement: Does the conversation accomplish the user’s intended outcome?

  • Conversational Progression: Is the agent navigating multi-step tasks in a natural, helpful way?

These evaluations help ensure your agents are effective not just at each step, but across the full journey. More information on Session-level Evaluations.

Agent and Multi-Agent Visualization

June 25, 2025

Easily inspect and debug multi-agent workflows with the new Agent Visibility feature. Alongside Traces and Spans, the new Agents tab auto-generates an interactive flowchart showing how agents, tools, and components interact step-by-step. With Agent Visibility, you can:

  • Visualize agent workflows end-to-end

  • Debug bottlenecks and errors with clarity

  • Link agents to traces and spans for deeper insights

  • Accelerate orchestration iteration and refinement

It works automatically across frameworks like Agno, Autogen, CrewAI, LangGraph, OpenAI Agents, and SmolAgents. More on agent tracing.

Alyx MCP Assistant

June 25, 2025

All Alyx skills are accessible via MCP, allowing seamless integration into your existing workflows. You can leverage the full suite of Alyx debugging and analysis tools wherever you build, without needing to switch contexts.

This means you can debug traces directly from your IDE while building in environments like Cursor, or connect through Claude Code to identify improvement areas. Refer to the video below for setting up Alyx via MCP in Cursor.

Arize Copilot v3: Alyx & Trace Troubleshooting

June 25, 2025

We are excited to introduce Alyx, our major upgrade to our Copilot UI. You can now drop context anywhere across the app and open copilot with the magic of ctrl+L to instantly pull context for smarter, faster help.

We’re also introducing Trace Troubleshooting — a new Copilot skill that lets you navigate the entire trace to pinpoint issues. Built with O3 under the hood, you can now:

  • @ specific spans

  • Use existing span skills for span questions or evals

  • Let Copilot traverse and diagnose like a pro

  • Ability to customize the hot key if you don’t want to use Ctrl + L

New Homepage & Onboarding Experience

June 20, 2025

We’ve just rolled out a revamped onboarding flow to guide first-time users smoothly into either Tracing or Experiments.

Last updated

Was this helpful?