Arize Research

LibreEval: The Open-Source Benchmark for RAG Hallucination Detection

  • The largest open-source RAG hallucination dataset (LibreEval1.0)

  • A fine-tuned evaluation model that outperforms baseline LLMs in hallucination recall

  • Open-source tools for generating, labeling, and benchmarking hallucination datasets

Get Started

LibreEval is fully open-source and ready for use by researchers, AI engineers, and developers.