Quickstart: Experiments
This guide helps you run experiments to test and validate changes in your LLM applications against a curated dataset. Learn more about the concepts of experiments here.
Upload a CSV as a dataset
Download this sample CSV and upload it into the UI. The CSV must have an id
column. See example CSV below:
id,topic
1,zebras
2,clouds
If you want to use your application data, you can also create a dataset from your spans.
Test a prompt in playground
Load the dataset you created into prompt playground, and run it to see your results. It's dead simple. Once you've finished the run, you can save it as an experiment to track your changes.
Run an evaluator on your playground experiments
Create a task to run evaluations on your experiment results. We will run the evaluator task in the background as soon as you create the task. Learn more

Last updated
Was this helpful?