AI that improves itself.

See what we shipped at Observe

Should I Use the Same LLM for My Eval as My Agent? Testing Self-Evaluation Bias

Published October 8, 2025