Defuddle
Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.
If not installed: npm install -g defuddle
Usage
Always use -m for markdown output:
defuddle parse <url> -m
Save to file:
defuddle parse <url> -m -o content.md
Extract specific metadata:
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain
Output formats
| Flag | Format |
| ----------- | -------------------------------- |
| -m | Markdown (default choice) |
| --json | JSON with both HTML and markdown |
| (none) | HTML |
| -p <name> | Specific metadata property |