Agent Skills: judge-verification
Independent LLM judge evaluates task completion separately from the executing agent, catching false success claims by reviewing task goal, actions taken, final state, and evidence. Produces PASS/FAIL with confidence score and reasoning.
UncategorizedID: oimiragieo/agent-studio/judge-verification
19
Install this agent skill to your local
Skill Files
Browse the full folder contents for judge-verification.
Loading file tree…
Select a file to preview its contents.