Agent Skills: advanced-evaluation
This skill should be used when the user asks to "implement LLM-as-judge", "compare model outputs", "create evaluation rubrics", "mitigate evaluation bias", or mentions direct scoring, pairwise comparison, position bias, evaluation pipelines, or automated quality assessment.
UncategorizedID: hainamchung/agent-assistant/advanced-evaluation
4622
Install this agent skill to your local
Skill Files
Browse the full folder contents for advanced-evaluation.
Loading file tree…
Select a file to preview its contents.