Agent Skills: Solution Comparator Skill

Compare multiple solutions for correctness and performance

UncategorizedID: a5c-ai/babysitter/solution-comparator

Install this agent skill to your local

pnpm dlx add-skill https://github.com/a5c-ai/babysitter/tree/HEAD/plugins/babysitter/skills/babysit/process/specializations/algorithms-optimization/skills/solution-comparator

Skill Files

Browse the full folder contents for solution-comparator.

Download Skill

Loading file tree…

plugins/babysitter/skills/babysit/process/specializations/algorithms-optimization/skills/solution-comparator/SKILL.md

Skill Metadata

Name
solution-comparator
Description
Compare multiple solutions for correctness and performance

Solution Comparator Skill

Purpose

Compare multiple algorithm solutions against the same test cases to verify correctness and benchmark performance.

Capabilities

  • Run solutions against same test cases
  • Performance benchmarking and comparison
  • Output diff analysis
  • Find minimal failing test case
  • Memory usage comparison
  • Time complexity validation

Target Processes

  • correctness-proof-testing
  • complexity-optimization
  • upsolving
  • algorithm-implementation

Comparison Modes

  1. Correctness: Compare outputs against a known-correct solution
  2. Performance: Benchmark execution time across solutions
  3. Stress Testing: Run with random large inputs to find discrepancies
  4. Minimal Counter-example: Binary search to find smallest failing case

Input Schema

{
  "type": "object",
  "properties": {
    "solutions": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "name": { "type": "string" },
          "code": { "type": "string" },
          "language": { "type": "string" }
        }
      }
    },
    "testCases": { "type": "array" },
    "mode": {
      "type": "string",
      "enum": ["correctness", "performance", "stress", "minimal"]
    },
    "oracleSolution": { "type": "string" },
    "timeout": { "type": "integer", "default": 5000 }
  },
  "required": ["solutions", "mode"]
}

Output Schema

{
  "type": "object",
  "properties": {
    "success": { "type": "boolean" },
    "results": { "type": "array" },
    "discrepancies": { "type": "array" },
    "performance": { "type": "object" },
    "minimalFailingCase": { "type": "object" }
  },
  "required": ["success"]
}