Claude Eval Runner Plugin
Claude Code plugin for scaffolding, running, documenting, and publishing AI evaluations — with a curated ground-truth list of open-source eval tools baked in.
View on GitHubClaude Code plugin for scaffolding, running, documenting, and publishing AI evaluations — with a curated ground-truth list of open-source eval tools baked in.
View on GitHub