The most inspiring discoveries in agent evals
Evaluate AI agent skills with this TypeScript-based tool that provides objective performance assessments.