Agent Skills: Ralph Loop

Autonomous iteration loop with dual-mode support. Standalone mode uses Stop hooks (RALPH_ACTIVE=1). Multi-agent mode uses router-managed iteration. Never traps the host/router session.

UncategorizedID: oimiragieo/agent-studio/ralph-loop

Install this agent skill to your local

pnpm dlx add-skill https://github.com/oimiragieo/agent-studio/tree/HEAD/.claude/skills/ralph-loop

Skill Files

Browse the full folder contents for ralph-loop.

Download Skill

Loading file tree…

.claude/skills/ralph-loop/SKILL.md

Skill Metadata

Name
ralph-loop
Description
Autonomous iteration loop with dual-mode support. Standalone mode uses Stop hooks (RALPH_ACTIVE=1). Multi-agent mode uses router-managed iteration. Never traps the host/router session.

Ralph Loop

Autonomous iteration loop for Claude Code with dual-mode support. Named after the Ralph Wiggum technique popularized in the Claude Code community (Dec 2025 - Feb 2026).

Purpose

Enable Claude Code to work autonomously on well-defined tasks until genuine completion, without manual re-prompting. The skill provides:

  1. Dual-mode architecture — Standalone mode (Stop hook) or multi-agent mode (router-managed iteration)
  2. RALPH_ACTIVE env var guard — Stop hook only activates when RALPH_ACTIVE=1 is set, preventing host/router trapping
  3. State persistence — JSON state file tracks iteration count, timestamps, findings
  4. Circuit breaker — Detects repeated failures and exits gracefully
  5. Verification-first exit — Completion signal only accepted when validation commands pass
  6. Guardrails — Accumulated lessons from past failures prevent repeated mistakes

When to Use

  • Well-defined tasks with clear, testable success criteria
  • Iterative work (get tests passing, fix lint errors, audit codebase)
  • Overnight or background autonomous runs
  • Multi-phase audits with structured findings logs

When NOT to Use

  • Subjective goals ("make the code better")
  • One-off fixes that don't need iteration
  • Tasks requiring human judgment at each step
  • Exploratory research without clear deliverables

Architecture — Two Modes

Mode 1: Standalone (Stop Hook)

For single-session use. The Stop hook keeps the session alive until completion. Requires RALPH_ACTIVE=1 env var (set by the launcher scripts).

User runs ralph-audit.sh/bat (sets RALPH_ACTIVE=1)
    |
    v
Claude works on task (reads PROMPT.md)
    |
    v
Claude attempts to exit
    |
    v
Stop hook intercepts (ralph-stop-hook.cjs)
    |
    +-- RALPH_ACTIVE != '1'? --> YES --> exit(0) immediately (no-op)
    |
    +-- Completion signal found? --> YES --> Clear state, exit(0)
    |
    +-- Max iterations reached? --> YES --> Clear state, exit(0)
    |
    +-- NO --> Increment iteration, save state, re-inject prompt, block exit

Mode 2: Router-Managed (Multi-Agent) -- Primary mode for agent-studio

This is the primary mode within agent-studio. The router manages iteration by spawning and re-spawning QA agents via Task(). No Stop hook is involved -- the router itself controls the loop lifecycle. The state file at .claude/context/runtime/ralph-state.json within agent-studio tracks iteration progress.

Router receives /ralph-loop command
    |
    v
Router spawns QA agent with audit prompt via Task()
    |
    v
QA agent completes, reports findings via TaskUpdate()
    |
    v
Router checks audit state file (.claude/context/runtime/ralph-state.json)
    |
    +-- RALPH_AUDIT_COMPLETE_NO_FINDINGS? --> Done
    |
    +-- RALPH_ITERATION_COMPLETE? --> Spawn another QA agent
    |
    +-- Max iterations? --> Report and stop

Why two modes: Stop hooks fire on the host session, not on subagents. In multi-agent setups (like agent-studio), the Stop hook would trap the router. The RALPH_ACTIVE guard ensures the hook is a no-op unless explicitly activated by a standalone launcher. In agent-studio, Mode 2 is used exclusively -- the router orchestrates iteration without any Stop hook registration.

Components

Skill Bundle (included in this skill directory)

| File | Relative Path | Purpose | | ----------------------- | -------------------------------------- | ----------------------------------------------- | | Main script | scripts/main.cjs | CLI for status/reset/config of ralph loop state | | Pre-execute hook | hooks/pre-execute.cjs | Input validation before skill execution | | Post-execute hook | hooks/post-execute.cjs | Output validation after skill execution | | Input schema | schemas/input.schema.json | Input validation schema | | Output schema | schemas/output.schema.json | Output contract schema | | Implementation template | templates/implementation-template.md | PROMPT.md and launcher templates | | Skill rule | rules/ralph-loop.md | Skill-specific rules |

Project-Level Files (set up per-project by the user)

These files are NOT part of the skill bundle. They live in the hosting project's .claude/ directory and must be created/configured by the user or via the implementation template.

| File | Project Path | Purpose | | --------------- | ------------------------------------------ | ---------------------------------------------------------- | | Stop hook | .claude/hooks/ralph-stop-hook.cjs | Loop controller (stdin -> transcript check -> re-inject) | | Prompt | .claude/ralph/PROMPT.md | Audit/task instructions re-injected each iteration | | State | .claude/context/runtime/ralph-state.json | Iteration count, timestamps, findings count (auto-created) | | Guardrails | .claude/ralph/guardrails.md | Learned lessons from past failures | | Launcher (bash) | .claude/ralph/ralph-audit.sh | Unix/macOS launcher script (sets RALPH_ACTIVE=1) | | Launcher (bat) | .claude/ralph/ralph-audit.bat | Windows launcher script (sets RALPH_ACTIVE=1) | | Settings | .claude/settings.json | Stop hook registration (Mode 1 only) |

Usage

Standalone Mode (Stop Hook)

# From your project's workspace root — sets RALPH_ACTIVE=1 automatically
.claude/ralph/ralph-audit.sh      # Unix/macOS
.claude\ralph\ralph-audit.bat     # Windows

Manual Standalone Start

# Must set RALPH_ACTIVE=1 yourself for the Stop hook to activate
export RALPH_ACTIVE=1
claude --print-output-format text < .claude/ralph/PROMPT.md

Multi-Agent Mode (Router-Managed)

Use the /ralph-loop command within agent-studio, or have the router spawn QA agents with the audit prompt. The router manages iteration; no Stop hook is involved.

Custom Prompt

Create a custom PROMPT.md with:

  1. Mission — What the agent must accomplish
  2. Scope — Specific areas to audit/fix
  3. Validation commands — Commands that must pass
  4. Completion condition — The exact completion signal string

Completion Signals

| Signal | Meaning | | --------------------------------------------- | ---------------------------------------- | | RALPH_AUDIT_COMPLETE_NO_FINDINGS | All validations pass, zero open findings | | RALPH_ITERATION_COMPLETE: N findings remain | Progress made, N findings still open |

Stop Hook Protocol

The stop hook (ralph-stop-hook.cjs) follows this protocol:

  1. GUARD 0: Check RALPH_ACTIVE env var — if not '1', exit 0 immediately (no stdin read, no file checks, no-op). This is the critical protection that prevents the hook from trapping the host/router session.
  2. Read stdin (Claude Code transcript JSON)
  3. Check stop_hook_active guard (prevent infinite re-triggering)
  4. Check if ralph-state.json exists (no state = no active loop)
  5. Check transcript for RALPH_AUDIT_COMPLETE_NO_FINDINGS
    • Found → clear state file, exit 0 (allow exit)
  6. Load state from ralph-state.json, increment iteration
  7. Check for progress signal (RALPH_ITERATION_COMPLETE)
  8. Check circuit breaker (findings count stuck for N iterations)
  9. Check max iterations (default: 25)
    • Reached → clear state, exit 0 (force stop)
  10. Save updated state
  11. Read PROMPT.md and write to stdout as JSON { decision: 'block', reason: <prompt> }
  12. Exit 0 (with block decision in stdout)

Exit Codes

| Code | Meaning | | ---- | ----------------------------------------------- | | 0 | Allow exit (complete, max iterations, or error) | | 2 | Block exit, stdout fed back as next prompt |

Configuration

Environment Variables

| Variable | Default | Description | | --------------------------------- | ---------------------------------- | -------------------------------------------------------------------------------------------------- | | RALPH_ACTIVE | (unset) | Must be 1 to activate the Stop hook. Set by launcher scripts. Without this, the hook is a no-op. | | RALPH_MAX_ITERATIONS | 25 | Maximum loop iterations | | RALPH_COMPLETION_SIGNAL | RALPH_AUDIT_COMPLETE_NO_FINDINGS | String that signals completion | | RALPH_CIRCUIT_BREAKER_THRESHOLD | 3 | Consecutive iterations with unchanged findings count before circuit breaker trips |

State File Schema

{
  "iteration": 3,
  "startedAt": "2026-02-28T10:00:00Z",
  "lastRunAt": "2026-02-28T10:15:00Z",
  "lastFindingsCount": 5
}

TDD State Schema (for TDD-mode loops)

When using ralph-loop for TDD workflows, use a separate TDD-specific state file at .claude/context/runtime/tdd-state.json. This schema tracks per-scenario RED/GREEN evidence across session interruptions:

{
  "scenarios": [
    {
      "id": "sc-001",
      "description": "routing-guard blocks Write on creator paths",
      "status": "pending|red|green|refactored"
    }
  ],
  "completedScenarios": [
    {
      "id": "sc-001",
      "evidenceCommand": "node --test tests/hooks/routing-guard.test.cjs",
      "redEvidence": "AssertionError: expected exit code 2, got 0",
      "greenEvidence": "✓ routing-guard blocks Write (4ms)",
      "passedAt": "2026-03-12T10:00:00Z"
    }
  ],
  "currentScenario": "sc-002",
  "evidenceLog": [
    {
      "scenarioId": "sc-001",
      "phase": "red|green|refactored",
      "output": "<verbatim test runner output>",
      "timestamp": "2026-03-12T09:58:00Z"
    }
  ]
}

TDD Session Resumption: On each loop iteration, before picking a scenario:

const tddState = JSON.parse(
  fs.readFileSync('.claude/context/runtime/tdd-state.json', 'utf-8') || '{}'
);
const completedIds = (tddState.completedScenarios || []).map(s => s.id);
const remaining = (tddState.scenarios || []).filter(s => !completedIds.includes(s.id));
if (remaining.length === 0) {
  // All scenarios complete — emit RALPH_AUDIT_COMPLETE_NO_FINDINGS
  console.log('RALPH_AUDIT_COMPLETE_NO_FINDINGS');
  process.exit(0);
}
const nextScenario = remaining[0];

Critical rule: Never re-execute scenarios already in completedScenarios. The evidenceLog is append-only — each phase (red/green/refactored) adds a new entry. Circuit breaker trips if currentScenario is unchanged for 3+ iterations.

Integration with TDP: When spawning the developer agent for a TDD loop iteration, extract the red evidence from evidenceLog and inject verbatim into the spawn prompt (see tdd skill — Test-Driven Prompting pattern).

Writing Effective Prompts

Structure

# Mission

One-line directive.

## Before Doing Anything

Step 1: Read previous findings (if exists)
Step 2: Load context/skills

## Scope

Numbered list of areas to audit/fix.

## Validation Commands

Commands that must pass for completion.

## Findings Log

Where to write findings (path + format).

## Completion Condition

Exact signal strings and when to use each.

Best Practices

  1. Binary criteria — "All tests pass" not "code is good"
  2. Validation commands — Include runnable commands (pnpm test, pnpm lint)
  3. Findings format — Structured findings with severity, file, status
  4. Idempotent — Prompt must work correctly on any iteration (read state first)
  5. Context management — Include token-saver invocation for long sessions

Guardrails Pattern

The guardrails.md file accumulates "Signs" — lessons learned from failures:

### Sign: [Name]

- **Trigger**: When this applies
- **Instruction**: What to do
- **Added after**: What failure taught this

Agents should read guardrails.md at the start of each iteration and add new Signs when they encounter novel failure modes.

Enforcement Hooks

Input validated against schemas/input.schema.json before execution. Output contract defined in schemas/output.schema.json.

Anti-Patterns

| Anti-Pattern | Why It Fails | Correct Approach | | ------------------------------------ | ----------------------------------------------- | ----------------------------------------------- | | No max iterations | Infinite loop burns tokens | Always set maxIterations (default 25) | | Vague completion criteria | Agent claims "done" prematurely | Use binary pass/fail validation commands | | No state persistence | Progress lost on context reset | Write findings to file, read at iteration start | | Stop hook without RALPH_ACTIVE guard | Traps host/router session in multi-agent setups | Check RALPH_ACTIVE=1 before any other logic | | Running standalone mode from router | Router gets trapped by Stop hook | Use router-managed iteration (Mode 2) instead | | No guardrails | Same mistakes repeated each iteration | Maintain guardrails.md with learned Signs |

Iron Laws

  1. NO LOOP WITHOUT VERIFICATION — Completion signal must be backed by passing validation commands
  2. NO LOOP WITHOUT MAX ITERATIONS — Every loop must have a safety cap
  3. NO LOOP WITHOUT STATE FILE — Progress must be persisted to survive context resets
  4. NO LOOP WITHOUT GUARDRAILS — Failures must be recorded to prevent repetition
  5. RALPH_ACTIVE GUARD FIRST — Stop hook must check RALPH_ACTIVE=1 env var before reading stdin, checking files, or any other logic. This is the primary protection against trapping the host/router.

Memory Protocol (MANDATORY)

Before starting:

Read .claude/context/memory/learnings.md using the Read tool.

Check for:

  • Previously run ralph loops and their outcomes
  • Known audit patterns and failure modes
  • User preferences for iteration limits

After completing:

  • Loop completed successfully → Append summary to learnings.md
  • New guardrail discovered → Add Sign to .claude/ralph/guardrails.md
  • Architecture decision → Append to decisions.md

ASSUME INTERRUPTION: Your context may reset. If it's not in memory, it didn't happen.