Back to tags
Tag

Agent Skills with tag: content-extraction

18 skills match this tag. Use tags to discover related Agent Skills and explore similar workflows.

article-extractor

Extract clean article content from URLs and save as markdown. Triggers when user provides a webpage URL and wants to download it, extract content, get a clean version without ads, capture an article for offline reading, save an article, grab content from a page, archive a webpage, clip an article, or read something later. Handles blog posts, news articles, tutorials, documentation pages, and similar web content. Supports Wayback Machine for dead links or paywalled content. This skill handles the entire workflow - do NOT use web_fetch or other tools first, just call the extraction script directly with the URL.

markdownweb-scrapingoffline-accesscontent-extraction
jrajasekera
jrajasekera
0

tapestry

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

content-extractionaction-planningunified-processingmultimedia-handling
prof-ramos
prof-ramos
0

article-extractor

Extract clean article content from URLs (blog posts, articles, tutorials) and save as readable text. Use when user wants to download, extract, or save an article/blog post from a URL without ads, navigation, or clutter.

articlescontent-extractiontext-extractionweb-scraping
prof-ramos
prof-ramos
0

article-extractor

Extract clean article content from URLs (blog posts, articles, tutorials) and save as readable text. Use when user wants to download, extract, or save an article/blog post from a URL without ads, navigation, or clutter.

articlescontent-extractiontext-extractionurl-processing
ovachiever
ovachiever
81

make-distilled

Transform raw captured content into distilled knowledge by extracting summary, key points, principles, patterns, entities, and quotes, storing the result in the distilled/ directory.

content-extractionsummarizationinformation-organizationknowledge-capture
dudarev
dudarev
0

get-youtube-transcript-raw

Capture a YouTube video transcript as raw material using `ytt`, storing it in the raw/ directory with minimal metadata for later distillation.

youtubetranscriptyttcontent-extraction
dudarev
dudarev
0

get-web-page-raw

Capture a web page as raw material (extracted text/Markdown) with metadata, storing it in the raw/ directory for later distillation.

web-scrapingmarkdowncontent-extractionautomation
dudarev
dudarev
0

browser

Web scraping using shot-scraper. Read web pages, extract content, interact with websites.

web-scrapingbrowser-automationcontent-extractionshot-scraper
ciallo-agent
ciallo-agent
1

document-summary

Summarize documents, extract key points, and generate structured outlines

summarizationcontent-extractionoutline-generationdocument-processing
tatat
tatat
1

website-crawler

Crawl and ingest websites into whorl. Use when scraping a personal site, blog, or extracting web content for the knowledge base.

web-crawlingcontent-extractionknowledge-basescraping
Uzay-G
Uzay-G
1

web-reader

Implement web page content extraction capabilities using the z-ai-web-dev-sdk. Use this skill when the user needs to scrape web pages, extract article content, retrieve page metadata, or build applications that process web content. Supports automatic content extraction with title, HTML, and publication time retrieval.

web-scrapingcontent-extractionmetadata-retrievalhtml
UholySmokes
UholySmokes
1

fetch-web

Fetch web pages, scrape content, fill forms, take screenshots.

web-scrapingform-fillingscreenshot-captureweb-automation
michalvavra
michalvavra
1

web-scraping

Web scraping with anti-bot bypass, content extraction, and poison pill detection. Use when extracting content from websites, handling paywalls, implementing scraping cascades, or processing social media. Covers trafilatura, Playwright with stealth mode, yt-dlp, and instaloader patterns.

web-scrapingheadless-browserpaywall-bypasscontent-extraction
jamditis
jamditis
1

article-extractor

Extract clean article content from URLs using reader. Use when user wants to download/extract/save an article from a URL.

content-extractionweb-scrapinghttp
srid
srid
7

tapestry

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

content-extractiondocument-processingsummarizationtask-planning
gupsammy
gupsammy
102

skill-creator

Use when the user has a document (PDF, markdown, book notes, research paper, methodology guide) containing theoretical knowledge or frameworks and wants to convert it into an actionable, reusable skill. Invoke when the user mentions "create a skill from this document", "turn this into a skill", "extract a skill from this file", or when analyzing documents with methodologies, frameworks, processes, or systematic approaches that could be made actionable for future use.

skill-creationdocument-processingpdf-processingmarkdown
lyndonkl
lyndonkl
82

webinar-to-content-multiplier

Convert webinar recordings into blog posts, social snippets, email series. Extract key quotes, statistics, and soundbites.

content-writingcontent-extractionsocial-mediaemail-marketing
OneWave-AI
OneWave-AI
237

web-fetch

Fetch and extract clean content from URLs using Jina Reader API. Use when users need to read webpage content, extract article text, or fetch URL content for analysis. Triggers on "fetch this page", "read this URL", "extract content from", "get the content of", "what does this page say".

web-scrapingcontent-extractiontext-extractionapi
vaayne
vaayne
20