Benchmarks
Automated LLM evaluation results for skills, plugins, and instructions. Runs weekly and on every pull request that modifies assets.
0
Total Assets Evaluated
—
Global Pass Rate
March 23, 2026
Last Run Date
🔬
No Evaluations Yet
Evaluation results will appear here after the first pipeline run. The pipeline runs automatically every Monday at 06:00 UTC, or when assets change in a pull request.
📅 Weekly on Mondays 🔀 On PR changes 💬 Via /evaluate comment 🖐 Manual dispatch