Pre-Built Evals
The following are simple functions on top of the LLM evals building blocks that are pre-tested with benchmark data.
Supported Models.
The models are instantiated and usable in the LLM Eval function. The models are also directly callable with strings.
model = OpenAIModel(model_name="gpt-4",temperature=0.6)
model("What is the largest costal city in France?")
We currently support a growing set of models for LLM Evals, please check out the Eval Models section for usage.
Last updated
Was this helpful?