Back to tags
Tag

Agent Skills with tag: ocr

4 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

markitdown

Convert various file formats (PDF, Office documents, images, audio, web content, structured data) to Markdown optimized for LLM processing. Use when converting documents to markdown, extracting text from PDFs/Office files, transcribing audio, performing OCR on images, extracting YouTube transcripts, or processing batches of files. Supports 20+ formats including DOCX, XLSX, PPTX, PDF, HTML, EPUB, CSV, JSON, images with OCR, and audio with transcription.

markdownpdf-processingocraudio-transcription
ovachiever
ovachiever
81

ocr-document-processor

Extract text from images and scanned PDFs using OCR. Supports 100+ languages, table detection, structured output (markdown/JSON), and batch processing.

ocrtext-extractionbatch-processingmultilingual
dkyazzentwatwa
dkyazzentwatwa
3

sear

Semantic search and RAG for documents. Use when user needs to index PDF/DOCX/text files, perform semantic search, extract relevant content from document corpuses, or build RAG applications. Supports multi-corpus search, GPU acceleration, line-level citations, and document conversion with OCR.

document-processingsemantic-searchretrieval-augmented-generationocr
Guard8-ai
Guard8-ai
6

markdown-converter

Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.

file-conversiondocument-processingmarkdownpdf-processing
steipete
steipete
91180