Agent Skills: ml-model-eval-benchmark

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

UncategorizedID: openclaw/skills/ml-model-eval-benchmark

Repository

openclawLicense: MIT
3,7881,049

Install this agent skill to your local

pnpm dlx add-skill https://github.com/openclaw/skills/ml-model-eval-benchmark

Skill Files

Browse the full folder contents for ml-model-eval-benchmark.

Download Skill

Loading file tree…

Select a file to preview its contents.