Smithers Workflow Engine Skill

Smithers Workflow Engine

TypeScript framework for deterministic, resumable AI workflows using JSX. Runtime: Bun >= 1.3. State: SQLite via Drizzle ORM. Validation: Zod schemas. Version: 0.8.2.

Init CLI

uv run /path/to/skills/smithers/scripts/skill_smithers.py <target-dir>
uv run scripts/skill_smithers.py ./scripts/my-workflow --name my-workflow --no-install

Copies the full template into <target-dir>, runs bun install, and initializes git + jj (skips each if already present). Requires jj on PATH: brew install jj.

Project Config Setup

The workflow engine (<target-dir>/) is project-agnostic — never edit it per project. All domain-specific content lives in a separate config directory:

projects/<my-project>/smithers-config/
  config.ts            # ProjectConfig (name, phases, cwd, instructions, agents)
  instructions/        # MDX files injected into component prompts
    research.mdx, plan.mdx, implement.mdx, test.mdx,
    code-review.mdx, prd-review.mdx, review-fix.mdx,
    final-review.mdx, update-progress.mdx
  output/
    context/           # agents write context docs here (create empty)
    plans/             # agents write plan docs here (create empty)
  run.sh               # launcher

config.ts must satisfy the ProjectConfig interface (see <target-dir>/types/project.ts). Key fields:

phases — ordered list of { id, name, description, metadata? } — the outer loop iterates over these
cwd — absolute path to the repo root where agents run
instructions — map of step name → imported MDX function (e.g. research: ResearchInstructions)
agents.systemPromptContent — project context injected into every agent's system prompt

instructions/*.mdx are additive — they fill the {props.projectInstructions} slot in each generic component prompt. Write only what's domain-specific (spec references, file paths, rules). Phase metadata fields are available as {props.fieldName}.

run.sh sets SMITHERS_PROJECT and invokes the engine:

#!/usr/bin/env bash
SCRIPT_DIR="$(cd "$(dirname "${BASH_SOURCE[0]}")" && pwd)"
cd "<target-dir>"
export SMITHERS_PROJECT="$SCRIPT_DIR/config.js"
bunx smithers run workflow.tsx

Run with: ./projects/<my-project>/smithers-config/run.sh

Core Execution Model

Render-schedule-execute loop:

Render — JSX tree → available tasks
Schedule — determine runnable tasks (dependencies + concurrency)
Execute — agent runs, output validates against Zod, persists to SQLite
Re-render — tree updates with new outputs, unblocking dependents
Repeat until done/failed

JSX Primitives

| Component | Purpose | |-------------|------------------------------------------------| | <Workflow> | Root container | | <Task> | Single agent work unit with schema validation | | <Sequence> | Sequential execution | | <Parallel> | Concurrent execution with maxConcurrency | | <Ralph> | Loop with until condition + maxIterations | | <Branch> | Conditional routing | | <Worktree> | Run tasks in an isolated git/jj worktree (auto-created if missing). Optional branch prop creates/resets a named branch in the worktree, making restarts idempotent. |

Task Props

| Prop | Type | Purpose | |----------------|------------------------------|---------------------------------------------------------------------| | id | string | Unique node ID (phase-prefix for multi-phase loops) | | output | ZodObject | Required. Pass outputs.xxx from createSmithers. Enables schema validation, auto-retry, and JSON instructions. Never pass a string — that API was removed in v0.7.1. | | agent | AgentLike \| AgentLike[] | Agent or array of agents [primary, fallback1, fallback2, ...]. Tries in order: attempt 1 uses agents[0], attempt 2 uses agents[1], etc. (capped at last). Replaces the old fallbackAgent prop. | | retries | number | Retry budget on failure | | timeoutMs | number | Per-attempt timeout in milliseconds | | skipIf | boolean | Skip execution when true | | continueOnFail| boolean | Don't block siblings when this task fails |

Available Agents

| Agent | Import | CLI Required | Notes | |------------------|-------------------------------|---------------|--------------------------------------------| | ClaudeCodeAgent | smithers-orchestrator | claude | Default reviewer/researcher | | CodexAgent | smithers-orchestrator | codex | Default implementer | | GeminiAgent | smithers-orchestrator | gemini | Google Gemini CLI; json output format by default (v0.8.2, changed from text) | | KimiAgent | smithers-orchestrator | kimi | Kimi CLI; thinking=true and text output by default (v0.8.2, reverted from stream-json); --final-message-only auto-enabled; parallel runs use isolated share dirs; supports agent: "okabe", MCP configs | | AmpAgent | smithers-orchestrator | amp | Amp CLI; supports thread (continue existing thread), visibility, mcpConfig, dangerouslyAllowAll | | PiAgent | smithers-orchestrator | pi | Pi CLI |

Key Rules

Always pass output={outputs.xxx} to <Task> — enables schema validation, auto-retry, and auto-appended JSON output instructions. Pass the ZodObject from createSmithers's outputs return — string keys were removed in v0.7.1.
Use ctx.outputMaybe() not ctx.output() — gracefully handles missing outputs during first render
Use ctx.latest() for cross-iteration decisions — outputMaybe is scoped to the current Ralph iteration; use ctx.latest(table, nodeId) for skipIf, loop until, and allPhasesComplete checks
Use .nullable() never .optional() in Zod schemas — OpenAI structured outputs rejects .optional() fields
Phase-prefix nodeIds when multiple phases share a loop — ${phaseId}:step-name
Set continueOnFail on review Tasks — one reviewer failing shouldn't block the other
Use agent arrays for rate-limit resilience — set agent={[primary, fallback]} on heavy tasks so retry attempts switch to a different model automatically. The old fallbackAgent prop was removed in v0.8.0.

Workflow Philosophy

Atomic unit pipeline: run the full pipeline (implement → test → review → fix → final-review) for each logically complete unit of work. Do NOT implement everything then review once.

"Atomic" = one logical concern, not smallest diff: 20 similar operators = ONE unit. Porting 14 similar test cases = ONE unit. The outer Ralph loop iterates until all phases pass the FinalReview gate.

Resources

references/atomic-workflow.md — logical unit pipeline, nextLogicalUnit chaining, batching heuristics
references/patterns.md — outer Ralph, gating, data threading, pass tracking, dual-model review
references/troubleshooting.md — dirty git, OpenAI schema errors, stale runs, SQLite debugging

Agent Skills: Smithers Workflow Engine

Install this agent skill to your local

Skill Files