jina-reader

Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.

7 stars

byDemerzels-lab

View on GitHub Installation ↓

Best use case

jina-reader is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.

Teams using jina-reader should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/jina-reader/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/ericsantos/jina-reader/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/jina-reader/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How jina-reader Compares

Feature / Agent	jina-reader	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Jina Reader

Extract clean web content via Jina AI — without exposing your server IP.

## Read a URL

```bash
{baseDir}/scripts/reader.sh "https://example.com/article"
```

## Search the web (top 5 results with full content)

```bash
{baseDir}/scripts/reader.sh --mode search "latest AI news 2025"
```

## Fact-check a statement

```bash
{baseDir}/scripts/reader.sh --mode ground "OpenAI was founded in 2015"
```

## Options

| Flag | Description | Default |
|------|-------------|---------|
| `--mode` | `read`, `search`, `ground` | `read` |
| `--selector` | CSS selector to extract specific region | — |
| `--wait` | CSS selector to wait for before extraction | — |
| `--remove` | CSS selectors to remove (comma-separated) | — |
| `--proxy` | Country code for geo-proxy (`br`, `us`, etc.) | — |
| `--nocache` | Force fresh content (skip cache) | off |
| `--format` | `markdown`, `html`, `text`, `screenshot` | `markdown` |
| `--json` | Raw JSON output | off |

## Examples

```bash
# Extract article content
{baseDir}/scripts/reader.sh "https://blog.example.com/post"

# Extract specific section via CSS selector
{baseDir}/scripts/reader.sh --selector "article.main" "https://example.com"

# Remove nav and ads before extraction
{baseDir}/scripts/reader.sh --remove "nav,footer,.ads" "https://example.com"

# Search with JSON output
{baseDir}/scripts/reader.sh --mode search --json "AI enterprise trends"

# Read via Brazil proxy
{baseDir}/scripts/reader.sh --proxy br "https://example.com.br"

# Fact-check a claim
{baseDir}/scripts/reader.sh --mode ground "Tesla is the most valuable car company"
```

## API Key

```bash
export JINA_API_KEY="jina_..."
```

Free tier: 10M tokens (no signup needed). Get key at https://jina.ai/reader/

## Pricing

- **Read:** ~$0.005/page (standard) | 3x for ReaderLM-v2
- **Search:** 10K tokens fixed + variable per result
- **Ground:** ~300K tokens/request (~30s latency)

## Why Jina Reader?

- **IP protection** — requests route through Jina's infra, not your server
- **Clean markdown** — readability extraction + optional ReaderLM-v2
- **Dynamic content** — headless Chrome renders JavaScript
- **Structured extraction** — JSON schema support for data extraction

Related Skills

book-reader

from Demerzels-lab/elsamultiskillagent

Read books (epub, pdf, txt) from various sources with progress tracking.

iyeque-pdf-reader

from Demerzels-lab/elsamultiskillagent

Extract text, search inside PDFs, and produce summaries.

twitter-reader

from Demerzels-lab/elsamultiskillagent

A comprehensive skill for reading and extracting data from X (formerly Twitter) tweets using multiple reliable data.

rss-reader

from Demerzels-lab/elsamultiskillagent

Monitor RSS and Atom feeds for content research.

safe-file-reader

from Demerzels-lab/elsamultiskillagent

Read files from documents directory safely

castreader

from Demerzels-lab/elsamultiskillagent

Read any web page aloud with natural AI voices. The only skill that extracts content from URLs (including Kindle, WeRead, Notion, Google Docs) and converts to MP3 audio with paragraph-level highlighting.

sergei-mikhailov-tg-channel-reader

from Demerzels-lab/elsamultiskillagent

Read and summarize posts from Telegram channels via MTProto (Pyrogram or Telethon)

rss-ai-reader

from Demerzels-lab/elsamultiskillagent

📰 RSS AI 阅读器 — 自动抓取订阅、LLM生成摘要、多渠道推送！支持 Claude/OpenAI 生成中文摘要，推送到飞书/Telegram/Email。触发条件: 用户要求订阅RSS、监控博客、抓取新闻、生成摘要、设置定时抓取、 "帮我订阅"、"监控这个网站"、"每天推送新闻"、RSS/Atom feed 相关。

4chan-reader

from Demerzels-lab/elsamultiskillagent

Browse 4chan boards and extract thread discussions into structured text files. Use when you need to fetch catalog information or specific thread content (including post text and file metadata) from 4chan boards like /a/, /vg/, /v/, etc.

jina-ai

from Demerzels-lab/elsamultiskillagent

Web reading and searching via Jina AI APIs.

paylock

from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.