Agent Skills: Async Jobs

Async job processing patterns for background tasks, Celery workflows, task scheduling, retry strategies, and distributed task execution. Use when implementing background job processing, task queues, or scheduled task systems.

workflow-automationID: yonatangross/orchestkit/async-jobs

Install this agent skill to your local

pnpm dlx add-skill https://github.com/yonatangross/orchestkit/tree/HEAD/src/skills/async-jobs

Skill Files

Browse the full folder contents for async-jobs.

Download Skill

Loading file tree…

src/skills/async-jobs/SKILL.md

Skill Metadata

Name
async-jobs
Description
Async job processing patterns for background tasks, Celery workflows, task scheduling, retry strategies, and distributed task execution. Use when implementing background job processing, task queues, or scheduled task systems.

Async Jobs

Patterns for background task processing with Celery, ARQ, and Redis. Covers task queues, canvas workflows, scheduling, retry strategies, rate limiting, and production monitoring. Each category has individual rule files in references/ loaded on-demand.

Quick Reference

| Category | Rules | Impact | When to Use | |----------|-------|--------|-------------| | Configuration | celery-config | HIGH | Celery app setup, broker, serialization, worker tuning | | Task Routing | task-routing | HIGH | Priority queues, multi-queue workers, dynamic routing | | Canvas Workflows | canvas-workflows | HIGH | Chain, group, chord, nested workflows | | Retry Strategies | retry-strategies | HIGH | Exponential backoff, idempotency, dead letter queues | | Scheduling | scheduled-tasks | MEDIUM | Celery Beat, crontab, database-backed schedules | | Monitoring | monitoring-health | MEDIUM | Flower, custom events, health checks, metrics | | Result Backends | result-backends | MEDIUM | Redis results, custom states, progress tracking | | ARQ Patterns | arq-patterns | MEDIUM | Async Redis Queue for FastAPI, lightweight jobs | | Temporal Workflows | temporal-workflows | HIGH | Durable workflow definitions, sagas, signals, queries | | Temporal Activities | temporal-activities | HIGH | Activity patterns, workers, heartbeats, testing |

Total: 10 rules across 9 categories

Quick Start

@app.task(bind=True, max_retries=3, default_retry_delay=60)
def process_payment(self, order_id: str):
    try:
        return gateway.charge(order_id)
    except TransientError as exc:
        raise self.retry(exc=exc, countdown=2 ** self.request.retries * 60)

Load more examples: Read("${CLAUDE_SKILL_DIR}/references/quick-start-examples.md") for Celery retry task and ARQ/FastAPI integration patterns.

Configuration

Production Celery app configuration with secure defaults and worker tuning.

Key Patterns

  • JSON serialization with task_serializer="json" for safety
  • Late acknowledgment with task_acks_late=True to prevent task loss on crash
  • Time limits with both task_time_limit (hard) and task_soft_time_limit (soft)
  • Fair distribution with worker_prefetch_multiplier=1
  • Reject on lost with task_reject_on_worker_lost=True

Key Decisions

| Decision | Recommendation | |----------|----------------| | Serializer | JSON (never pickle) | | Ack mode | Late ack (task_acks_late=True) | | Prefetch | 1 for fair, 4-8 for throughput | | Time limit | soft < hard (e.g., 540/600) | | Timezone | UTC always |

Task Routing

Priority queue configuration with multi-queue workers and dynamic routing.

Key Patterns

  • Named queues for critical/high/default/low/bulk separation
  • Redis priority with queue_order_strategy: "priority" and 0-9 levels
  • Task router classes for dynamic routing based on task attributes
  • Per-queue workers with tuned concurrency and prefetch settings
  • Content-based routing for dynamic workflow dispatch

Key Decisions

| Decision | Recommendation | |----------|----------------| | Queue count | 3-5 (critical/high/default/low/bulk) | | Priority levels | 0-9 with Redis x-max-priority | | Worker assignment | Dedicated workers per queue | | Prefetch | 1 for critical, 4-8 for bulk | | Routing | Router class for 5+ routing rules |

Canvas Workflows

Celery canvas primitives for sequential, parallel, and fan-in/fan-out workflows.

Key Patterns

  • Chain for sequential ETL pipelines with result passing
  • Group for parallel execution of independent tasks
  • Chord for fan-out/fan-in with aggregation callback
  • Immutable signatures (si()) for steps that ignore input
  • Nested workflows combining groups inside chains
  • Link error callbacks for workflow-level error handling

Key Decisions

| Decision | Recommendation | |----------|----------------| | Sequential | Chain with s() | | Parallel | Group for independent tasks | | Fan-in | Chord (all must succeed for callback) | | Ignore input | Use si() immutable signature | | Error in chain | Reject stops chain, retry continues | | Partial failures | Return error dict in chord tasks |

Retry Strategies

Retry patterns with exponential backoff, idempotency, and dead letter queues.

Key Patterns

  • Exponential backoff with retry_backoff=True and retry_backoff_max
  • Jitter with retry_jitter=True to prevent thundering herd
  • Idempotency keys in Redis to prevent duplicate processing
  • Dead letter queues for failed tasks requiring manual review
  • Task locking to prevent concurrent execution of singleton tasks
  • Base task classes with shared retry configuration

Key Decisions

| Decision | Recommendation | |----------|----------------| | Retry delay | Exponential backoff with jitter | | Max retries | 3-5 for transient, 0 for permanent | | Idempotency | Redis key with TTL | | Failed tasks | DLQ for manual review | | Singleton | Redis lock with TTL |

Scheduling

Celery Beat periodic task configuration with crontab, database-backed schedules, and overlap prevention.

Key Patterns

  • Crontab for time-based schedules (daily, weekly, monthly)
  • Interval for fixed-frequency tasks (every N seconds)
  • Database scheduler with django-celery-beat for dynamic schedules
  • Schedule locks to prevent overlapping long-running scheduled tasks
  • Adaptive polling with self-rescheduling tasks

Key Decisions

| Decision | Recommendation | |----------|----------------| | Schedule type | Crontab for time-based, interval for frequency | | Dynamic | Database scheduler (django-celery-beat) | | Overlap | Redis lock with timeout | | Beat process | Separate process (not embedded) | | Timezone | UTC always |

Monitoring

Production monitoring with Flower, custom signals, health checks, and Prometheus metrics.

Key Patterns

  • Flower dashboard for real-time task monitoring
  • Celery signals (task_prerun, task_postrun, task_failure) for metrics
  • Health check endpoint verifying broker connection and active workers
  • Queue depth monitoring for autoscaling decisions
  • Beat monitoring for scheduled task dispatch tracking

Key Decisions

| Decision | Recommendation | |----------|----------------| | Dashboard | Flower with persistent storage | | Metrics | Prometheus via celery signals | | Health | Broker + worker + queue depth | | Alerting | Signal on task_failure | | Autoscale | Queue depth > threshold |

Result Backends

Task result storage, custom states, and progress tracking patterns.

Key Patterns

  • Redis backend for task status and small results
  • Custom task states (VALIDATING, PROCESSING, UPLOADING) for progress
  • update_state() for real-time progress reporting
  • S3/database for large result storage (never Redis)
  • AsyncResult for querying task state and progress

Key Decisions

| Decision | Recommendation | |----------|----------------| | Status storage | Redis result backend | | Large results | S3 or database (never Redis) | | Progress | Custom states with update_state() | | Result query | AsyncResult with state checks |

ARQ Patterns

Lightweight async Redis Queue for FastAPI and simple background tasks.

Key Patterns

  • Native async/await with arq for FastAPI integration
  • Worker lifecycle with startup/shutdown hooks for resource management
  • Job enqueue from FastAPI routes with enqueue_job()
  • Job status tracking with Job.status() and Job.result()
  • Delayed tasks with _delay=timedelta() for deferred execution

Key Decisions

| Decision | Recommendation | |----------|----------------| | Simple async | ARQ (native async) | | Complex workflows | Celery (chains, chords) | | In-process quick | FastAPI BackgroundTasks | | LLM workflows | LangGraph (not Celery) |

Tool Selection

Load: Read("${CLAUDE_SKILL_DIR}/references/quick-start-examples.md") for the full tool comparison table (ARQ, Celery, RQ, Dramatiq, FastAPI BackgroundTasks).

Anti-Patterns (FORBIDDEN)

Load details: Read("${CLAUDE_SKILL_DIR}/references/anti-patterns.md") for full list.

Key rules: never run long tasks in request handlers, never block on results inside tasks, never store large results in Redis, always use idempotency for retried tasks.

Temporal Workflows

Durable execution engine for reliable distributed applications with Temporal.io.

Key Patterns

  • Workflow definitions with @workflow.defn and deterministic code
  • Saga pattern with compensation for multi-step transactions
  • Signals and queries for external interaction with running workflows
  • Timers with workflow.wait_condition() for human-in-the-loop
  • Parallel activities via asyncio.gather inside workflows

Key Decisions

| Decision | Recommendation | |----------|----------------| | Workflow ID | Business-meaningful, idempotent | | Determinism | Use workflow.random(), workflow.now() | | I/O | Always via activities, never directly |

Temporal Activities

Activity and worker patterns for Temporal.io I/O operations.

Key Patterns

  • Activity definitions with @activity.defn for all I/O
  • Heartbeating for long-running activities (> 60s)
  • Error classification with ApplicationError(non_retryable=True) for business errors
  • Worker configuration with dedicated task queues
  • Testing with WorkflowEnvironment.start_local()

Key Decisions

| Decision | Recommendation | |----------|----------------| | Activity timeout | start_to_close for most cases | | Error handling | Non-retryable for business errors | | Testing | WorkflowEnvironment for integration tests |

Related Skills

  • ork:python-backend - FastAPI, asyncio, SQLAlchemy patterns
  • ork:langgraph - LangGraph workflow patterns (use for LLM workflows, not Celery)
  • ork:distributed-systems - Resilience patterns, circuit breakers
  • ork:monitoring-observability - Metrics and alerting

Capability Details

Load details: Read("${CLAUDE_SKILL_DIR}/references/capability-details.md") for full keyword index and problem-solution mapping across all 8 capabilities.