Agent Skills: eval-harness
Eval-driven development (EDD) — define pass/fail criteria before coding, measure with pass@k metrics. Use when defining completion criteria or measuring agent reliability.
UncategorizedID: xbklairith/kisune/eval-harness
Install this agent skill to your local
Skill Files
Browse the full folder contents for eval-harness.
Loading file tree…
Select a file to preview its contents.