Agent Workflow Designer
Overview
This skill guides the design and architecture of AI agent workflows using proven methodologies. When a user presents a problem, this skill helps structure an agent-based solution following the 9-step building process and 8-layer architecture framework validated at Meta.
Workflow Decision Tree
When a user shares a problem or requests agent design help:
-
Assess the problem scope
- Is the problem clearly defined? → Proceed to Problem Analysis
- Is the problem vague? → Ask clarifying questions about desired outcomes and constraints
-
Determine architecture complexity
- Simple task (single action)? → Single agent with basic tools
- Complex task (multiple sub-tasks)? → Consider multi-agent orchestration
- Integration task (connecting systems)? → Focus on Layer 4 (Tooling) design
-
Follow the appropriate workflow
- New agent from scratch → Apply 9-Step Building Process
- Existing agent improvement → Focus on specific layers needing enhancement
- Tool integration problem → Apply MCP and tooling patterns
9-Step Agent Building Process
Use this sequential workflow when designing a new agent from scratch:
Step 1: Define Purpose and Scope
Key principle: Start with job-to-be-done, not technology.
Ask the user:
- What specific outcome does the end user need?
- What are the constraints (budget, time, resources)?
- What's the success metric?
Bad scope example: "An AI assistant for customer service"
Good scope example: "An agent that takes customer complaints, pulls order history from Shopify API, and drafts refund approvals for orders under $200"
Decision point: Narrow scope = better performance. Resist building Swiss Army knives.
Step 2: Structure Inputs and Outputs
Treat the agent as a function with structured interfaces:
Inputs:
- Use JSON schemas or Pydantic models, not free text
- Define required vs. optional fields
- Specify data types and validation rules
Outputs:
- Return data objects, not prose
- Define clear error states
- Include confidence scores when relevant
Example structure:
Input: {
"complaint_text": "string",
"customer_id": "string",
"order_id": "string (optional)"
}
Output: {
"action": "approve_refund | escalate | request_info",
"refund_amount": "number",
"reasoning": "string",
"confidence": "number"
}
Step 3: Write System Instructions
Critical: Spend 80% of design time here.
Include in system prompt:
- Role definition: "You are a sales qualification specialist..."
- Behavioral guidelines: "Always ask for budget before proposing solutions"
- Output format requirements: Specify JSON structure, word limits, tone
- Edge case handling: What to do when data is missing or ambiguous
Testing strategy: A great system prompt can make GPT-3.5 outperform poorly prompted GPT-4.
Step 4: Enable Reasoning and External Actions
ReAct Framework Pattern:
- Reason: Analyze the current state and decide next action
- Act: Call an API, use a tool, or make a decision
- Observe: Review the result and determine if goal is achieved
Start simple:
- Begin with if/then logic before complex reasoning chains
- Add tools incrementally (don't overwhelm with 50 tools at once)
- Test each tool integration independently
Common tools to integrate:
- Calculators for math operations
- Web browsers for research
- Database queries for data retrieval
- API calls to external systems
Step 5: Orchestrate Multiple Agents (When Needed)
When to use multi-agent architecture:
- Task has clearly separable sub-tasks
- Different sub-tasks require different expertise
- Parallel processing would improve speed
When NOT to use multi-agent:
- Simple linear workflows
- Tasks that require continuous context
- When handoff complexity exceeds benefit
Common 4-agent pattern:
- Research Agent: Gathers information from sources
- Analysis Agent: Processes and synthesizes data
- Writing Agent: Creates structured outputs
- QA Agent: Reviews quality and accuracy
Keep handoffs simple: Complex orchestration = complex failures.
Step 6: Implement Memory and Context
Three types of memory to consider:
Conversation history:
- What happened this session
- Recent user interactions
- Current task state
User context:
- User preferences and settings
- Past interaction patterns
- Historical decisions
Knowledge retrieval:
- Relevant information from knowledge base
- Similar past cases
- Domain-specific context
Implementation guidance:
- Start with simple conversation buffers
- Add vector databases only when needing semantic search across large datasets
- Consider memory retrieval latency in architecture
Step 7: Add Multimedia Capabilities
Modern agents should handle:
- Voice input/output for accessibility
- Image understanding for visual tasks
- Document processing (PDF, DOCX, spreadsheets)
Strategic approach: Add capabilities based on actual user needs, not "nice-to-haves."
Step 8: Format and Deliver Results
Output is your product's UX. Design outputs for:
Human consumption:
- Clear formatting and structure
- Scannable with headers and bullets
- Professional appearance
System consumption:
- Valid JSON/XML
- Consistent field names
- Error codes for handling
Quality standard: Great agent outputs look like a human created them.
Step 9: Build Interface or API
Delivery method options:
- Chat interface for conversational tasks
- API endpoints for system integration
- Integration with existing tools (Slack, email, CRM)
Best practice: The best agents feel invisible—they just make things happen.
8-Layer Architecture Framework
When analyzing agent architecture needs, consider which layers require attention:
Layer 1: Infrastructure
Foundation: Cloud, databases, APIs, compute resources
Key considerations:
- GPU/TPU requirements for inference
- Data storage and retrieval speed
- Load balancing for scale
- Monitoring and observability
Common mistake: Underestimating compute needs—agents make more API calls than traditional apps.
Layer 2: Agent Internet
Operating system for agents: Identity, state management, inter-agent communication
Current state: Mostly custom-built, but platforms like LangChain and CrewAI are emerging.
Layer 3: Protocol
Standards for interoperability: MCP (Model Context Protocol) is becoming the standard
Key principle: Bet on open standards, not proprietary solutions. MCP allows any tool to work with any agent.
Layer 4: Tooling Enrichment
Agent superpowers: RAG systems, function calling, external integrations
Quality over quantity: 5 rock-solid tools > 50 flaky integrations
Tool categories:
- Data retrieval (databases, APIs)
- Computation (calculators, processors)
- Communication (email, messaging)
- Content creation (documents, reports)
Layer 5: Cognition Reasoning
The brain: Planning, decision-making, error handling
Critical elements:
- Guardrails to prevent hallucinations
- Error recovery strategies
- Confidence scoring
- Graceful degradation
User forgiveness: Users forgive agents that fail gracefully, not ones that spiral into nonsense.
Layer 6: Memory Personalization
Human touch: Personal context, preferences, conversation history
Start simple: Store user preferences and conversation context before building complex personalization.
Layer 7: Application
User-facing products: The actual agent functionality users interact with
Focus strategy: Nail one use case before expanding to others.
Layer 8: Ops Governance
Risk management: Monitoring, cost control, privacy, oversight
Build from day one: Retrofitting governance is expensive and painful.
Key components:
- Cost tracking per agent action
- Privacy enforcement and data handling
- Human-in-the-loop for critical decisions
- Audit logs and compliance
Problem-to-Solution Workflow
When a user presents a problem:
Step 1: Clarify the problem
- What's the current manual process?
- What's the desired outcome?
- What are the constraints (time, cost, technical)?
- What data sources are available?
Step 2: Assess agent appropriateness Not every problem needs an agent. Consider:
- Is the task repetitive and rule-based?
- Does it require decision-making with context?
- Would automation provide significant value?
- Is the problem scope clear and bounded?
Step 3: Map to architecture Using the 8 layers, identify which need focus:
- Simple task → Focus on Layers 4, 5, 7 (tools, reasoning, application)
- Complex integration → Add Layer 3 (protocol) emphasis
- Scalability concern → Prioritize Layers 1, 8 (infrastructure, ops)
Step 4: Design workflow Apply the 9-step building process, calling out:
- Critical decision points
- Tool integration requirements
- Multi-agent needs (if any)
- Memory and context strategy
Step 5: Identify implementation path Based on user's role and resources:
- For PMs: High-level architecture and tool selection
- For engineers: Detailed technical implementation with code patterns
- For product teams: Full stack from requirements to monitoring
Tool Integration Patterns
MCP (Model Context Protocol) Integration
When tools support MCP:
- Agent discovers available tools
- Agent calls tools using standardized interface
- Tool returns structured response
- Agent processes and continues workflow
Advantage: Write once, use with any agent.
Custom API Integration
When building custom integrations:
- Define clear API contract (inputs/outputs)
- Implement error handling and retries
- Add rate limiting and caching
- Monitor usage and costs
- Document for agent consumption
Common Integration Scenarios
CRM Integration (Salesforce, HubSpot):
- Read customer data
- Create/update records
- Search across objects
- Trigger workflows
Communication Tools (Slack, Email):
- Send messages/notifications
- Read incoming requests
- Monitor channels
- Respond to mentions
Data Sources (Databases, APIs):
- Query structured data
- Retrieve documents
- Search knowledge bases
- Aggregate information
Decision Framework: Single vs. Multi-Agent
Use Single Agent When:
- Task is linear and sequential
- Context must be maintained throughout
- Decision-making is unified
- Complexity of orchestration > benefit
Use Multi-Agent When:
- Clear task separation exists
- Sub-tasks need different expertise
- Parallel processing improves performance
- Quality benefits from specialization
Example - Customer Support:
Single agent sufficient for: "Take customer complaint, pull order history, draft refund approval"
Multi-agent beneficial for: "Monitor social media, categorize issues, research solutions, generate responses, escalate critical cases, track resolution"
Common Pitfalls and Solutions
Pitfall 1: Scope Creep
Problem: Trying to build a general-purpose assistant Solution: Define narrow, specific job-to-be-done with clear success metrics
Pitfall 2: Tool Overload
Problem: Giving agent 50+ tools upfront Solution: Start with 5 essential tools, add incrementally based on actual needs
Pitfall 3: Skipping System Prompt
Problem: Generic or minimal instructions Solution: Invest 80% of time crafting detailed system prompt with examples and edge cases
Pitfall 4: No Error Handling
Problem: Agent breaks on unexpected inputs Solution: Design graceful degradation, clear error states, and fallback behaviors
Pitfall 5: Ignoring Costs
Problem: Runaway API costs from inefficient agent design Solution: Build cost monitoring from day one, implement caching, optimize prompt length
Pitfall 6: Over-Engineering Architecture
Problem: Building all 8 layers simultaneously Solution: Start with Layers 4, 5, 7 (tools, reasoning, application), add others as needed
Output Format
When providing agent workflow solutions, structure the response as:
- Problem Restatement: Confirm understanding of the user's need
- Agent Architecture Recommendation: Single vs. multi-agent, with rationale
- Step-by-Step Workflow: Apply relevant steps from the 9-step process
- Tool Integration Plan: Specific tools needed and integration approach
- Layer Analysis: Which of the 8 layers need focus and why
- Implementation Guidance: Prioritized next steps based on user's role
- Success Metrics: How to measure if the agent is working
Agent Taxonomy Quick Reference
When users ask about existing tools:
Category 1: Consumer Agents (Built-In)
- Examples: ChatGPT Agent, Claude, Gemini, Grok
- Best for: Quick tasks, research, content creation
- User type: Everyone, especially PMs
Category 2: No-Code Builders
- Examples: Zapier Central, n8n, Make
- Best for: Workflow automation without coding
- User type: PMs, operations teams
Category 3: Developer-First Platforms
- Examples: LangChain, CrewAI, AutoGen, Swarm
- Best for: Custom agent features in products
- User type: Engineering teams
Category 4: Specialized Agent Apps
- Examples: Cursor (coding), Perplexity (research), Notion AI (writing)
- Best for: Specific job-to-be-done with deep specialization
- User type: Domain-specific professionals