Agent-Skills.md

Agent Skills: PDF Processing Skill

Extract text and tables from PDF files, fill forms, merge documents. Use when working with PDF files or when the user mentions PDFs, forms, or document extraction.

UncategorizedID: aiskillstore/marketplace/pdf-processing

Author

aiskillstore

https://github.com/aiskillstore View all skills

Repository

aiskillstore/marketplace

aiskillstore

23014

Install this agent skill to your local

pnpm dlx add-skill https://github.com/aiskillstore/marketplace/tree/HEAD/skills/0xkynz/pdf-processing

Skill Files

Browse the full folder contents for pdf-processing.

Loading file tree…

skills/0xkynz/pdf-processing/SKILL.md

Skill Metadata

Name: pdf-processing
Description: Extract text and tables from PDF files, fill forms, merge documents. Use when working with PDF files or when the user mentions PDFs, forms, or document extraction.

PDF Processing Skill

This skill provides capabilities for working with PDF documents.

Quick Start

Use pdfplumber to extract text from PDFs:

import pdfplumber

with pdfplumber.open("document.pdf") as pdf:
    text = pdf.pages[0].extract_text()

Capabilities

Text Extraction

Extract text from single or multiple pages
Preserve layout and formatting
Handle multi-column documents

Table Extraction

Identify and extract tables
Convert to structured data (CSV, JSON)
Handle complex table layouts

Form Operations

Fill PDF forms programmatically
Extract form field values
Create fillable forms

Document Operations

Merge multiple PDFs
Split PDFs by page
Rotate pages
Add watermarks

Best Practices

Always check if the PDF is encrypted before processing
Handle OCR cases for scanned documents
Validate extracted data for accuracy
Use appropriate libraries (pdfplumber for extraction, PyPDF2 for manipulation)