AI that improves itself.

See what we shipped at Observe

Judging the Judges: Evaluating Alignment and Vulnerabilities in LLMs-as-Judges

Published August 16, 2024