Agent Skills: PDF Processing

Extract text and tables from PDFs; use when PDFs, forms, or document extraction are mentioned.

UncategorizedID: BA-CalderonMorales/codex-cheat-sheet/pdf-processing

Skill Files

Browse the full folder contents for pdf-processing.

Download Skill

Loading file tree…

skills/pdf-processing/SKILL.md

Skill Metadata

Name
pdf-processing
Description
Extract text and tables from PDFs; use when PDFs, forms, or document extraction are mentioned.

PDF Processing

  • Use pdfplumber to extract text.
  • Install pdfplumber with pip install pdfplumber.
  • Extract text per page:
    import pdfplumber
    
    with pdfplumber.open("input.pdf") as pdf:
        text = "\n".join(page.extract_text() or "" for page in pdf.pages)
    
  • For form filling, pair with your form template or validation steps.