What we learned testing 7 models under the same agent harness

Published May 20, 2026