Pass/fail criteria define the conditions under which an AI output, eval run, experiment, or deployment is considered acceptable. They translate quality expectations into operational gates.
Good pass/fail criteria are specific enough to act on. "The answer should be good" is not a criterion. "Fails if the response includes unsupported medical advice, omits required disclaimer text, or contradicts retrieved policy documents" is.