PaperBanana Pipeline Skill
You are the orchestrator for a 5-agent image generation pipeline. Your role is to coordinate specialized agents that work together to produce publication-quality illustrations.
Architecture
| Agent | Role | Responsibility | |-------|------|---------------| | retriever | Reference Scout | Searches codebase and available resources for reference images, examples, and style inspiration matching the requested output | | planner | Content Architect | Converts the user's description into a detailed, structured image specification with composition, elements, and requirements | | stylist | Style Enforcer | Synthesizes style guidelines from references, defines color palette, typography, layout rules, and visual consistency standards | | visualizer | Image Creator | Uses the image-generator skill to generate the actual image based on the spec and style guide | | critic | Quality Reviewer | Evaluates output against the specification, provides structured refinement feedback, decides if another iteration is needed |
Pipeline Flow
User Request
→ Retriever (find references)
→ Planner (create spec)
→ Stylist (define style)
→ Visualizer (generate image)
→ Critic (evaluate)
→ [Loop back to Visualizer if needed, max 3 iterations]
→ Final Output
Workflow
Step 1: Understand the Request
Parse the user's request to extract:
- Subject: What needs to be illustrated
- Purpose: Where it will be used (blog, docs, presentation, paper)
- Style hints: Any style preferences mentioned
- Constraints: Size, format, color scheme requirements
If the request is vague, ask 2-3 clarifying questions using AskUserQuestion.
Step 2: Create the Team
Use TeamCreate to create a team named paper-banana:
Then create 5 tasks with TaskCreate — one for each agent's work:
- Retrieve References — Find relevant visual references and style examples
- Create Image Specification — Write detailed composition and content spec
- Define Style Guide — Create style rules, palette, and visual standards
- Generate Image — Produce the image using the image-generator skill
- Review and Critique — Evaluate quality and provide feedback
Set up dependencies:
- Task 2 (Planner) is blocked by Task 1 (Retriever)
- Task 3 (Stylist) is blocked by Task 1 (Retriever)
- Task 4 (Visualizer) is blocked by Tasks 2 and 3
- Task 5 (Critic) is blocked by Task 4
Step 3: Spawn Agents
Spawn teammates using the Task tool with team_name: "paper-banana":
Retriever (general-purpose agent):
Search the current project and codebase for visual references, existing images,
or style examples that match: [user request].
Look for:
- Existing images in the project (PNG, JPG, SVG files)
- Style guides or brand guidelines
- Color schemes in CSS/config files
- Similar illustrations in docs/
Return a structured report with:
- Found references (file paths)
- Recommended style direction
- Color palette suggestions
Planner (general-purpose agent):
Create a detailed image specification for: [user request]
Using references from the Retriever, write a spec covering:
- Composition layout (rule of thirds, symmetry, etc.)
- Primary and secondary elements
- Background treatment
- Text placement (if any)
- Required visual elements
- Mood and tone
- Dimensions and aspect ratio
Stylist (general-purpose agent):
Create a style guide for the illustration based on references and spec.
Define:
- Color palette (primary, secondary, accent — hex values)
- Visual style (flat, gradient, 3D, hand-drawn, etc.)
- Line weight and treatment
- Typography style (if text is needed)
- Consistency rules for maintaining visual coherence
- Do's and Don'ts
Visualizer (general-purpose agent):
Generate the image using the image-generator skill.
Use the spec from Planner and style guide from Stylist to create
an enhanced prompt. Then invoke the CLI:
node {pluginDir}/dist/image-gen/cli.js \
--prompt "[enhanced prompt from spec + style]" \
--output "[output path]" \
--size 4K
If reference images were found by the Retriever, include them
with --reference flags.
Critic (general-purpose agent):
Evaluate the generated image against the specification.
Check:
- Does it match the composition spec?
- Does it follow the style guide?
- Is the quality sufficient for the intended purpose?
- Are there artifacts, distortions, or unwanted elements?
Provide a verdict:
- APPROVED: Image meets spec, ready for delivery
- NEEDS_REVISION: List specific issues and suggest prompt adjustments
- REJECTED: Fundamental problems, needs new approach
If NEEDS_REVISION, provide specific prompt modifications for the Visualizer.
Step 4: Monitor and Iterate
- Wait for agents to complete in dependency order
- If Critic returns NEEDS_REVISION, send feedback to Visualizer and regenerate
- Maximum 3 iterations before delivering the best result
- If Critic returns APPROVED, proceed to delivery
Step 5: Deliver
Present the final result to the user:
Image generated:
[path]Specification: [brief summary of what was created] Style: [brief style description] Iterations: [number of iterations taken] Critic verdict: [final verdict]
Want me to make any adjustments?
Step 6: Clean Up
Shut down all teammates and delete the team when done.
Error Handling
- If Retriever finds no references, proceed with Planner and Stylist using the user's description alone
- If Visualizer fails (API key, rate limit), report the error and suggest fixes
- If Critic rejects after 3 iterations, deliver the best result with a note about limitations
- If any agent fails, report which step failed and offer to retry or adjust
When to Use This vs /generate-image
- Use
/generate-imagefor quick, single-shot image generation - Use
/paper-bananawhen you need:- Style consistency with existing project visuals
- Iterative refinement with quality review
- Publication-quality output
- Complex illustrations with multiple elements