Data Skills Discovery
Provides automatic access to comprehensive data skills.
When This Skill Activates
This skill auto-activates when you're working with:
- ETL
- data pipelines
- batch processing
- stream processing
- data validation
- orchestration
- Airflow
- timely dataflow
- differential dataflow
- streaming aggregations
- windowing
- real-time analytics
Available Skills
Quick Reference
The Data category contains 9 skills:
- batch-processing - Orchestrating complex data pipelines with dependencies
- data-validation - Validating data schema before processing
- dataflow-coordination - Coordination patterns for distributed dataflow systems
- differential-dataflow - Differential computation for incremental updates and efficient joins
- etl-patterns - Designing data extraction from multiple sources
- pipeline-orchestration - Coordinating complex multi-step data workflows
- stream-processing - Processing real-time event streams (Kafka, Flink)
- streaming-aggregations - Windowing, sessionization, time-series aggregation
- timely-dataflow - Low-latency streaming computation with progress tracking
Load Full Category Details
For complete descriptions and workflows:
Read <cc-polymath-root>/skills/data/INDEX.md
This loads the full Data category index with:
- Detailed skill descriptions
- Usage triggers for each skill
- Common workflow combinations
- Cross-references to related skills
Load Specific Skills
Load individual skills as needed:
Traditional ETL/Batch
Read <cc-polymath-root>/skills/data/batch-processing.md Read <cc-polymath-root>/skills/data/data-validation.md Read <cc-polymath-root>/skills/data/etl-patterns.md Read <cc-polymath-root>/skills/data/pipeline-orchestration.md
Stream Processing
Read <cc-polymath-root>/skills/data/stream-processing.md Read <cc-polymath-root>/skills/data/streaming-aggregations.md
Advanced Dataflow Systems
Read <cc-polymath-root>/skills/data/timely-dataflow.md Read <cc-polymath-root>/skills/data/differential-dataflow.md Read <cc-polymath-root>/skills/data/dataflow-coordination.md
Common Workflow Combinations
Real-Time Analytics Pipeline
Load these skills together:
Read <cc-polymath-root>/skills/data/stream-processing.md # Kafka setup Read <cc-polymath-root>/skills/data/streaming-aggregations.md # Windowing patterns Read <cc-polymath-root>/skills/data/dataflow-coordination.md # Coordination
Incremental Computation System
Load these skills together:
Read <cc-polymath-root>/skills/data/timely-dataflow.md # Foundation Read <cc-polymath-root>/skills/data/differential-dataflow.md # Incremental updates Read <cc-polymath-root>/skills/data/dataflow-coordination.md # Distributed coordination
Hybrid Batch + Stream
Load these skills together:
Read <cc-polymath-root>/skills/data/batch-processing.md # Batch jobs Read <cc-polymath-root>/skills/data/stream-processing.md # Stream processing Read <cc-polymath-root>/skills/data/pipeline-orchestration.md # Overall coordination
Progressive Loading
This gateway skill enables progressive loading:
- Level 1: Gateway loads automatically (you're here now)
- Level 2: Load category INDEX.md for full overview
- Level 3: Load specific skills as needed
Usage Instructions
- Auto-activation: This skill loads automatically when Claude Code detects data work
- Browse skills: Run
Read <cc-polymath-root>/skills/data/INDEX.mdfor full category overview - Load specific skills: Use bash commands above to load individual skills
Next Steps: Run Read <cc-polymath-root>/skills/data/INDEX.md to see full category details.