ClaudeContent Extraction

defuddle

Extract clean markdown content from web pages using Defuddle CLI, removing clutter and navigation to save tokens. Use instead of WebFetch when the user provides a URL to read or analyze, for online documentation, articles, blog posts, or any standard web page.

31,392 stars

bysickn33

Complexity: medium

View on GitHub Installation ↓

About this skill

The 'defuddle' skill leverages the Defuddle CLI to transform any standard web page (such as articles, blog posts, online documentation, or news pages) into clean, readable markdown content. It intelligently strips away common web page clutter, including navigation menus, advertisements, sidebars, footers, and other extraneous elements, to isolate the primary textual content. This process significantly reduces the amount of data an AI agent needs to process, leading to improved token efficiency, faster processing, and more focused analysis or summarization. Integrated within the 'antigravity-awesome-skills' collection, 'defuddle' is designed to be a superior alternative to generic web fetching methods when the goal is to analyze or extract concise information from human-readable web content without being overwhelmed by noise.

Best use case

Efficiently process, summarize, or analyze web page content by stripping away visual clutter and extraneous information, ensuring AI agents focus only on the core text. This is particularly useful for tasks requiring concise information extraction from online articles, documentation, or blog posts, where token usage and content purity are critical considerations.

The agent will receive a clean, well-structured markdown representation of the primary content from the provided URL, free of typical web page clutter. This processed content will be significantly shorter than raw HTML, enabling more efficient analysis, summarization, or other text-based tasks by the AI agent, with lower token consumption and a clearer focus on relevant information.

Practical example

Example input

Please summarize the main points of this article: `https://www.theverge.com/23971408/ai-agents-google-openai-microsoft-anthropic-future`

Example output

The article discusses the emerging trend of 'AI agents' developed by tech giants like Google, OpenAI, Microsoft, and Anthropic. These agents are designed to perform complex, multi-step tasks autonomously by chaining together various tools and making decisions, rather than just responding to single prompts. Key points include their potential to revolutionize productivity, the challenges of ensuring reliability and safety, and the vision for a future where AI handles more intricate workflows, freeing up human time for higher-level tasks. The author also touches on the ethical implications and the race among companies to define this new frontier of AI.

When to use this skill

When an AI agent needs to read, summarize, or analyze the primary content of a standard web page provided by a user's URL.
To maximize token efficiency and reduce processing costs by eliminating irrelevant content like ads, navigation, and sidebars before feeding text to the agent.
For extracting clean, structured text from online articles, blog posts, documentation, news pages, or any content-heavy URL.
As a preferred alternative to generic 'WebFetch' or raw page-scraping methods when the goal is content understanding rather than raw HTML inspection.

When not to use this skill

When the task requires interacting with interactive web elements, forms, JavaScript-driven components, or dynamically loaded content.
If the AI agent needs to analyze the full, raw HTML structure, specific metadata, styling, or non-textual elements (images, videos) of a web page beyond their textual context.
For extracting data from highly dynamic web applications or Single Page Applications (SPAs) that heavily rely on client-side rendering beyond initial content load, which Defuddle might not fully process.
When the user's intent is to view the entire visual layout or specific non-content elements of a page, rather than just the core text.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/defuddle/SKILL.md --create-dirs "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/main/plugins/antigravity-awesome-skills-claude/skills/defuddle/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/defuddle/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How defuddle Compares

Feature / Agent	defuddle	Standard Approach
Platform Support	Claude	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	medium	N/A

Frequently Asked Questions

What does this skill do?

Which AI agents support this skill?

This skill is designed for Claude.

How difficult is it to install?

The installation complexity is rated as medium. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

Best AI Skills for Claude

Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.

ChatGPT vs Claude for Agent Skills

Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.

AI Agent for SaaS Idea Validation

Use AI agent skills for SaaS idea validation, market research, customer discovery, competitor analysis, and documenting startup hypotheses.

SKILL.md Source

# Defuddle

Use Defuddle CLI to extract clean readable content from web pages. Prefer over WebFetch for standard web pages — it removes navigation, ads, and clutter, reducing token usage.

## When to Use

- Use when the user provides a normal webpage URL to read, summarize, or analyze.
- Prefer it over noisy page-fetch approaches when token efficiency matters.
- Use for docs, articles, blog posts, and similar public web content.

If not installed: `npm install -g defuddle`

## Usage

Always use `--md` for markdown output:

```bash
defuddle parse <url> --md
```

Save to file:

```bash
defuddle parse <url> --md -o content.md
```

Extract specific metadata:

```bash
defuddle parse <url> -p title
defuddle parse <url> -p description
defuddle parse <url> -p domain
```

## Output formats

| Flag | Format |
|------|--------|
| `--md` | Markdown (default choice) |
| `--json` | JSON with both HTML and markdown |
| (none) | HTML |
| `-p <name>` | Specific metadata property |

Related Skills

nft-standards

31392

from sickn33/antigravity-awesome-skills

Master ERC-721 and ERC-1155 NFT standards, metadata best practices, and advanced NFT features.

Web3 & BlockchainClaude

nextjs-app-router-patterns

31392

from sickn33/antigravity-awesome-skills

Comprehensive patterns for Next.js 14+ App Router architecture, Server Components, and modern full-stack React development.

Web FrameworksClaude

new-rails-project

31392

from sickn33/antigravity-awesome-skills

Create a new Rails project

Code GenerationClaude

networkx

31392

from sickn33/antigravity-awesome-skills

NetworkX is a Python package for creating, manipulating, and analyzing complex networks and graphs.

Network AnalysisClaude

network-engineer

31392

from sickn33/antigravity-awesome-skills

Expert network engineer specializing in modern cloud networking, security architectures, and performance optimization.

Network EngineeringClaude

nestjs-expert

31392

from sickn33/antigravity-awesome-skills

You are an expert in Nest.js with deep knowledge of enterprise-grade Node.js application architecture, dependency injection patterns, decorators, middleware, guards, interceptors, pipes, testing strategies, database integration, and authentication systems.

Frameworks & LibrariesClaude

nerdzao-elite

31392

from sickn33/antigravity-awesome-skills

Senior Elite Software Engineer (15+) and Senior Product Designer. Full workflow with planning, architecture, TDD, clean code, and pixel-perfect UX validation.

Software DevelopmentClaude

nerdzao-elite-gemini-high

31392

from sickn33/antigravity-awesome-skills

Modo Elite Coder + UX Pixel-Perfect otimizado especificamente para Gemini 3.1 Pro High. Workflow completo com foco em qualidade máxima e eficiência de tokens.

Software DevelopmentClaudeGemini

native-data-fetching

31392

from sickn33/antigravity-awesome-skills

Use when implementing or debugging ANY network request, API call, or data fetching. Covers fetch API, React Query, SWR, error handling, caching, offline support, and Expo Router data loaders (useLoaderData).

API IntegrationClaude

n8n-workflow-patterns

31392

from sickn33/antigravity-awesome-skills

Proven architectural patterns for building n8n workflows.

Workflow AutomationClaude

n8n-validation-expert

31392

from sickn33/antigravity-awesome-skills

Expert guide for interpreting and fixing n8n validation errors.

Workflow AutomationClaude

n8n-node-configuration

31392

from sickn33/antigravity-awesome-skills

Operation-aware node configuration guidance. Use when configuring nodes, understanding property dependencies, determining required fields, choosing between get_node detail levels, or learning common configuration patterns by node type.

Workflow AutomationClaude