The Evaluator
Your go-to blog for insights on AI observability and evaluation.

Introducing ADB: Arize’s Proprietary OLAP Database
Earlier this month, we rolled out real‑time ingestion support to every Arize AX workspace—paid and free. With that launch, Arize now ingests terabytes of data every day across hundreds of…

Arize Observe 2025 – Product Releases
Arize Observe 2025 brought a wealth of new product releases, including a redesigned copilot, agent eval options, and state-of-the-art prompt optimization techniques. Check them all out below! Copilot v3: Alyx…

The Illusion of Thinking: What the Apple AI Paper Says About LLM Reasoning
A recent paper from Apple researchers—The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity—has stirred up significant discussion in the AI…
Sign up for our newsletter, The Evaluator — and stay in the know with updates and new resources:

Introducing GraphQL for Humans – Building a Text-To-GraphQL Agent In a Weekend
Working with GraphQL can often feel overwhelming, especially when you’re navigating massive schemas with tens of thousands of lines. Writing GraphQL queries is often a time-consuming task prone to errors,…

Accurate KV Cache Quantization with Outlier Tokens Tracing
Deploying large language models (LLMs) at scale is expensive—especially during inference. One of the biggest memory and performance bottlenecks? The KV Cache. In a new research paper, Accurate KV Cache…

New in Arize: Realtime Trace Ingestion, Prompt Playground Upgrades & More
In May, we expanded access to realtime trace ingestion across all Arize AX tiers, making it easier than ever to monitor LLM performance live. We also rolled out major usability…

Harnessing Databricks Mosaic AI Agent Framework and Arize for Next-Level GenAI Applications
Co-authored by Prasad Kona, Lead Partner Solutions Architect at Databricks Building production-ready AI agents that can reliably handle complex tasks remains one of the biggest challenges in generative AI today….

Arize AI Now Generally Available As Part of Azure Native Integrations
Arize AI, a leading platform for AI observability and LLM evaluation, today announced the general availability of its platform to developers as part of Azure Native Integrations. The debut follows…

Arize AI Accelerates Enterprise AI Adoption On-Premises With NVIDIA
Arize AI, a leader in large language model (LLM) evaluation and AI observability, today announced it is delivering a high-performance, on-premises AI for enterprises seeking to deploy and scale AI…

Scalable Chain of Thoughts via Elastic Reasoning
This paper introduces Elastic Reasoning, a novel framework designed to enhance the efficiency and scalability of large reasoning models (LRMs) by explicitly separating the reasoning process into two distinct phases:…