Agent-Skills.md

Agent Skills: Web Scraping

Extract structured data from web pages using CSS selectors and XPath

UncategorizedID: tatat/agents-playground/web-scraping

Author

tatat

https://github.com/tatat View all skills

Repository

tatat/agents-playground

tatat

1

Install this agent skill to your local

pnpm dlx add-skill https://github.com/tatat/agents-playground/tree/HEAD/skills/web-scraping

Skill Files

Browse the full folder contents for web-scraping.

Loading file tree…

skills/web-scraping/SKILL.md

Skill Metadata

Name: web-scraping
Description: Extract structured data from web pages using CSS selectors and XPath

Web Scraping

Extract structured data from web pages.

Capabilities

Fetch HTML content from URLs
Parse and extract specific elements (tables, lists, text)
Handle pagination
Output in JSON or CSV format

Supported Selectors

CSS selectors: .class, #id, tag
XPath expressions
Text patterns (regex)

Rate Limiting

Always respect robots.txt and implement delays between requests. Default delay: 1 second between requests.

Example

Scrape product names and prices from example.com/products
Output as JSON with fields: name, price, url