Agent Skills: Knowledge Extractor Skill

Extract tribal knowledge from code, documentation, and commit history to preserve institutional memory

UncategorizedID: a5c-ai/babysitter/knowledge-extractor

Install this agent skill to your local

pnpm dlx add-skill https://github.com/a5c-ai/babysitter/tree/HEAD/plugins/babysitter/skills/babysit/process/specializations/code-migration-modernization/skills/knowledge-extractor

Skill Files

Browse the full folder contents for knowledge-extractor.

Download Skill

Loading file tree…

plugins/babysitter/skills/babysit/process/specializations/code-migration-modernization/skills/knowledge-extractor/SKILL.md

Skill Metadata

Name
knowledge-extractor
Description
Extract tribal knowledge from code, documentation, and commit history to preserve institutional memory

Knowledge Extractor Skill

Extracts tribal knowledge from code comments, commit messages, documentation, and other sources to preserve institutional memory during migration.

Purpose

Enable knowledge preservation for:

  • Comment analysis and extraction
  • Commit message mining
  • Documentation parsing
  • Pattern recognition
  • Business rule discovery

Capabilities

1. Comment Analysis

  • Extract TODO/FIXME comments
  • Parse documentation comments
  • Identify explanatory notes
  • Find warning comments

2. Commit Message Mining

  • Extract rationale from commits
  • Identify bug fix context
  • Find feature explanations
  • Track decision history

3. Documentation Parsing

  • Parse markdown documentation
  • Extract from wikis
  • Process README files
  • Catalog API docs

4. Pattern Recognition

  • Identify coding patterns
  • Recognize idioms
  • Detect conventions
  • Map architectural patterns

5. Business Rule Extraction

  • Find business logic comments
  • Extract validation rules
  • Identify calculation explanations
  • Document edge cases

6. Glossary Generation

  • Build domain vocabulary
  • Define abbreviations
  • Map term usage
  • Create terminology guide

Tool Integrations

| Tool | Purpose | Integration Method | |------|---------|-------------------| | Sourcegraph | Code search | API | | GitHub API | Commit history | API | | grep/ripgrep | Pattern search | CLI | | Custom NLP | Text analysis | Library | | Confluence API | Wiki extraction | API |

Output Schema

{
  "extractionId": "string",
  "timestamp": "ISO8601",
  "knowledge": {
    "comments": [
      {
        "type": "todo|fixme|note|warning|explanation",
        "file": "string",
        "line": "number",
        "content": "string",
        "context": "string"
      }
    ],
    "commits": [
      {
        "hash": "string",
        "message": "string",
        "author": "string",
        "context": "string",
        "relatedFiles": []
      }
    ],
    "documentation": [],
    "businessRules": [],
    "glossary": {}
  }
}

Integration with Migration Processes

  • legacy-codebase-assessment: Knowledge discovery
  • documentation-migration: Source material

Related Skills

  • legacy-code-interpreter: Code understanding
  • documentation-generator: Doc creation

Related Agents

  • legacy-system-archaeologist: Uses for excavation
  • documentation-migration-agent: Uses for doc creation