- Run your agent on a set of similar queries
- Record the number of steps taken for each
- Calculate the convergence score:
avg(minimum steps taken for this query / steps in the run)
Evaluate agent path convergence to measure consistency in multi-step agent workflows
avg(minimum steps taken for this query / steps in the run)