Agent Skills: ml-model-eval-benchmark

Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.

UncategorizedID: openclaw/skills/ml-model-eval-benchmark

Author

Repository

openclawLicense: MIT

3,7881,049

pnpm dlx add-skill https://github.com/openclaw/skills/ml-model-eval-benchmark

Skill Files

Browse the full folder contents for ml-model-eval-benchmark.

Loading file tree…

Select a file to preview its contents.