Glossary of AI Terminology

What Is Prompt Evaluation?

Prompt evaluation

Prompt evaluation measures how changes to prompts affect output quality. It can compare prompt versions across the same dataset, score outputs with evaluators, and track regressions in behavior.

Prompt evaluation turns prompt engineering into an evidence-driven workflow. Instead of arguing about which wording "feels" better, teams can run experiments and compare correctness, relevance, safety, cost, latency, and task success.

Bi-weekly AI Research Paper Readings

Stay on top of emerging trends and frameworks.