AI that improves itself.

See what we shipped at Observe

How to build LLM-as-a-Judge evaluators that hold up in production

Published May 21, 2026