Agent Skills: sc-evaluate
LLM pipeline evaluation with oracle judge scoring. Runs prompts against gold standard datasets, evaluates output quality via LLM-as-judge, and generates scored reports with improvement recommendations.
UncategorizedID: tony363/superclaude/sc-evaluate
171
Install this agent skill to your local
Skill Files
Browse the full folder contents for sc-evaluate.
Loading file tree…
Select a file to preview its contents.