bloom_integrity_verification
>
UncategorizedView skill →
crisis_persistence_eval
>
UncategorizedView skill →
healthbench_evaluation
>
UncategorizedView skill →
phi_detection
>
UncategorizedView skill →
evaluation_v2
Anthropic-aligned medical safety evaluation with pass^k metrics, failure taxonomy, and anti-gaming graders
UncategorizedView skill →
evaluator-brief-generator
|
UncategorizedView skill →
scribegoat2-healthcare-eval
|
UncategorizedView skill →