Agent Skills: judge-verification

Independent LLM judge evaluates task completion separately from the executing agent, catching false success claims by reviewing task goal, actions taken, final state, and evidence. Produces PASS/FAIL with confidence score and reasoning.

UncategorizedID: oimiragieo/agent-studio/judge-verification

Install this agent skill to your local

pnpm dlx add-skill https://github.com/oimiragieo/agent-studio/judge-verification

Skill Files

Browse the full folder contents for judge-verification.

Download Skill

Loading file tree…

Select a file to preview its contents.