Classes of LLM Evaluations: A Deep Dive

Aparna Dhinakaran is Co-Founder and Chief Product Officer of Arize AI; Dat Ngo is an ML Solutions Architect at Arize AI. This video is part three in a series on unpacking advanced LLM evaluation techniques and best practices formulated through rigorous testing — spanning retrieval, summarization, and hallucination — to help ensure production readiness. A must-attend for AI & ML engineers and data scientists. This session covers classes of LLM evaluations.

Subscribe to our resources and blogs