Phoenix: Evaluating Tool Calls In LLM Pipelines

Using tool calling with LLMs, but struggling to evaluate their performance? Check out this latest video where we walkthrough our newest out of the box evaluator for Phoenix: Function Calling Evals. We'll show you how you can quickly understand where your pipeline is struggling, isolate issues, and experiment with different approaches. Example notebook: https://github.com/Arize-ai/phoenix/blob/b107d9bc848efd38f030a8c72954e89616c43723/tutorials/evals/evaluate_tool_calling.ipynb ⭐️ Star Phoenix on GitHub: https://github.com/Arize-ai/phoenix

Subscribe to our resources and blogs

Subscribe