Sarah Welsh, Author at Arize AI

Alyx 2.0: How we built an AI engineering agent

Register

Sarah_headshot

Sarah Welsh

Recent posts by Sarah Welsh

Scalable Chain of Thoughts via Elastic Reasoning

This paper introduces Elastic Reasoning, a novel framework designed to enhance the efficiency and scalability of large reasoning models (LRMs) by explicitly separating the reasoning process into two distinct phases:…

Sleep-time Compute: Beyond Inference Scaling at Test-time

We recently discussed “Sleep Time Compute: Beyond Inference Scaling at Test Time,” new research from the team at Letta. The paper addresses a key challenge in using powerful AI models:…

Paper Readings Research

Libre Eval Blog Image

LibreEval: A Smarter Way to Detect LLM Hallucinations

Over the past few weeks, the Arize team has generated the largest public dataset of hallucinations, as well as a series of fine-tuned evaluation models. We wanted to create a…

Paper Readings Research