Agent Skills: advanced-evaluation

This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.

UncategorizedID: hainamchung/agent-assistant/advanced-evaluation

Install this agent skill to your local

pnpm dlx add-skill https://github.com/hainamchung/agent-assistant/advanced-evaluation

Skill Files

Browse the full folder contents for advanced-evaluation.

Download Skill

Loading file tree…

Select a file to preview its contents.