Agent Skills: ml-model-eval-benchmark
Compare model candidates using weighted metrics and deterministic ranking outputs. Use for benchmark leaderboards and model promotion decisions.
UncategorizedID: openclaw/skills/ml-model-eval-benchmark
3,7881,049
Install this agent skill to your local
Skill Files
Browse the full folder contents for ml-model-eval-benchmark.
Loading file tree…
Select a file to preview its contents.