Agent Skills: PaperBanana Pipeline Skill

|

UncategorizedID: kwiggen/claude-code-plugin/paper-banana

Install this agent skill to your local

pnpm dlx add-skill https://github.com/kwiggen/claude-code-plugin/tree/HEAD/skills/paper-banana

Skill Files

Browse the full folder contents for paper-banana.

Download Skill

Loading file tree…

skills/paper-banana/SKILL.md

Skill Metadata

Name
paper-banana
Description
|

PaperBanana Pipeline Skill

You are the orchestrator for a 5-agent image generation pipeline. Your role is to coordinate specialized agents that work together to produce publication-quality illustrations.

Architecture

| Agent | Role | Responsibility | |-------|------|---------------| | retriever | Reference Scout | Searches codebase and available resources for reference images, examples, and style inspiration matching the requested output | | planner | Content Architect | Converts the user's description into a detailed, structured image specification with composition, elements, and requirements | | stylist | Style Enforcer | Synthesizes style guidelines from references, defines color palette, typography, layout rules, and visual consistency standards | | visualizer | Image Creator | Uses the image-generator skill to generate the actual image based on the spec and style guide | | critic | Quality Reviewer | Evaluates output against the specification, provides structured refinement feedback, decides if another iteration is needed |

Pipeline Flow

User Request
    → Retriever (find references)
    → Planner (create spec)
    → Stylist (define style)
    → Visualizer (generate image)
    → Critic (evaluate)
    → [Loop back to Visualizer if needed, max 3 iterations]
    → Final Output

Workflow

Step 1: Understand the Request

Parse the user's request to extract:

  • Subject: What needs to be illustrated
  • Purpose: Where it will be used (blog, docs, presentation, paper)
  • Style hints: Any style preferences mentioned
  • Constraints: Size, format, color scheme requirements

If the request is vague, ask 2-3 clarifying questions using AskUserQuestion.

Step 2: Create the Team

Use TeamCreate to create a team named paper-banana:

Then create 5 tasks with TaskCreate — one for each agent's work:

  1. Retrieve References — Find relevant visual references and style examples
  2. Create Image Specification — Write detailed composition and content spec
  3. Define Style Guide — Create style rules, palette, and visual standards
  4. Generate Image — Produce the image using the image-generator skill
  5. Review and Critique — Evaluate quality and provide feedback

Set up dependencies:

  • Task 2 (Planner) is blocked by Task 1 (Retriever)
  • Task 3 (Stylist) is blocked by Task 1 (Retriever)
  • Task 4 (Visualizer) is blocked by Tasks 2 and 3
  • Task 5 (Critic) is blocked by Task 4

Step 3: Spawn Agents

Spawn teammates using the Task tool with team_name: "paper-banana":

Retriever (general-purpose agent):

Search the current project and codebase for visual references, existing images,
or style examples that match: [user request].

Look for:
- Existing images in the project (PNG, JPG, SVG files)
- Style guides or brand guidelines
- Color schemes in CSS/config files
- Similar illustrations in docs/

Return a structured report with:
- Found references (file paths)
- Recommended style direction
- Color palette suggestions

Planner (general-purpose agent):

Create a detailed image specification for: [user request]

Using references from the Retriever, write a spec covering:
- Composition layout (rule of thirds, symmetry, etc.)
- Primary and secondary elements
- Background treatment
- Text placement (if any)
- Required visual elements
- Mood and tone
- Dimensions and aspect ratio

Stylist (general-purpose agent):

Create a style guide for the illustration based on references and spec.

Define:
- Color palette (primary, secondary, accent — hex values)
- Visual style (flat, gradient, 3D, hand-drawn, etc.)
- Line weight and treatment
- Typography style (if text is needed)
- Consistency rules for maintaining visual coherence
- Do's and Don'ts

Visualizer (general-purpose agent):

Generate the image using the image-generator skill.

Use the spec from Planner and style guide from Stylist to create
an enhanced prompt. Then invoke the CLI:

node {pluginDir}/dist/image-gen/cli.js \
  --prompt "[enhanced prompt from spec + style]" \
  --output "[output path]" \
  --size 4K

If reference images were found by the Retriever, include them
with --reference flags.

Critic (general-purpose agent):

Evaluate the generated image against the specification.

Check:
- Does it match the composition spec?
- Does it follow the style guide?
- Is the quality sufficient for the intended purpose?
- Are there artifacts, distortions, or unwanted elements?

Provide a verdict:
- APPROVED: Image meets spec, ready for delivery
- NEEDS_REVISION: List specific issues and suggest prompt adjustments
- REJECTED: Fundamental problems, needs new approach

If NEEDS_REVISION, provide specific prompt modifications for the Visualizer.

Step 4: Monitor and Iterate

  • Wait for agents to complete in dependency order
  • If Critic returns NEEDS_REVISION, send feedback to Visualizer and regenerate
  • Maximum 3 iterations before delivering the best result
  • If Critic returns APPROVED, proceed to delivery

Step 5: Deliver

Present the final result to the user:

Image generated: [path]

Specification: [brief summary of what was created] Style: [brief style description] Iterations: [number of iterations taken] Critic verdict: [final verdict]

Want me to make any adjustments?

Step 6: Clean Up

Shut down all teammates and delete the team when done.

Error Handling

  • If Retriever finds no references, proceed with Planner and Stylist using the user's description alone
  • If Visualizer fails (API key, rate limit), report the error and suggest fixes
  • If Critic rejects after 3 iterations, deliver the best result with a note about limitations
  • If any agent fails, report which step failed and offer to retry or adjust

When to Use This vs /generate-image

  • Use /generate-image for quick, single-shot image generation
  • Use /paper-banana when you need:
    • Style consistency with existing project visuals
    • Iterative refinement with quality review
    • Publication-quality output
    • Complex illustrations with multiple elements