cognitive-baseline-eval
Execute the Joseph Cognitive Baseline v2.1 (JC B-v2.1) 5-Scenario Test Suite to quantify AI alignment, friction maintenance, and protocol adherence.
evaluation-frameworkAI-alignmentprotocol-adherencescenario-testing
starwreckntx
1
evaluation-metrics
LLM evaluation frameworks, benchmarks, and quality metrics for production systems.
llm-evaluationevaluation-frameworkbenchmarksquality-metrics
pluginagentmarketplace
1
evaluation-rubrics
Use when need explicit quality criteria and scoring scales to evaluate work consistently, compare alternatives objectively, set acceptance thresholds, reduce subjective bias, or when user mentions rubric, scoring criteria, quality standards, evaluation framework, inter-rater reliability, or grade/assess work.
rubric-creationevaluation-frameworkquality-standardsacceptance-criteria
lyndonkl
0