Benchmarks

Automated LLM evaluation results for skills, plugins, and instructions. Runs weekly and on every pull request that modifies assets.

0
Total Assets Evaluated
Global Pass Rate
March 23, 2026
Last Run Date
🔬

No Evaluations Yet

Evaluation results will appear here after the first pipeline run. The pipeline runs automatically every Monday at 06:00 UTC, or when assets change in a pull request.

📅 Weekly on Mondays 🔀 On PR changes 💬 Via /evaluate comment 🖐 Manual dispatch