jina-reader
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.
Best use case
jina-reader is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.
Teams using jina-reader should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/jina-reader/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How jina-reader Compares
| Feature / Agent | jina-reader | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Web content extraction via Jina AI Reader API. Three modes: read (URL to markdown), search (web search + full content), ground (fact-checking). Extracts clean content without exposing server IP.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Jina Reader
Extract clean web content via Jina AI — without exposing your server IP.
## Read a URL
```bash
{baseDir}/scripts/reader.sh "https://example.com/article"
```
## Search the web (top 5 results with full content)
```bash
{baseDir}/scripts/reader.sh --mode search "latest AI news 2025"
```
## Fact-check a statement
```bash
{baseDir}/scripts/reader.sh --mode ground "OpenAI was founded in 2015"
```
## Options
| Flag | Description | Default |
|------|-------------|---------|
| `--mode` | `read`, `search`, `ground` | `read` |
| `--selector` | CSS selector to extract specific region | — |
| `--wait` | CSS selector to wait for before extraction | — |
| `--remove` | CSS selectors to remove (comma-separated) | — |
| `--proxy` | Country code for geo-proxy (`br`, `us`, etc.) | — |
| `--nocache` | Force fresh content (skip cache) | off |
| `--format` | `markdown`, `html`, `text`, `screenshot` | `markdown` |
| `--json` | Raw JSON output | off |
## Examples
```bash
# Extract article content
{baseDir}/scripts/reader.sh "https://blog.example.com/post"
# Extract specific section via CSS selector
{baseDir}/scripts/reader.sh --selector "article.main" "https://example.com"
# Remove nav and ads before extraction
{baseDir}/scripts/reader.sh --remove "nav,footer,.ads" "https://example.com"
# Search with JSON output
{baseDir}/scripts/reader.sh --mode search --json "AI enterprise trends"
# Read via Brazil proxy
{baseDir}/scripts/reader.sh --proxy br "https://example.com.br"
# Fact-check a claim
{baseDir}/scripts/reader.sh --mode ground "Tesla is the most valuable car company"
```
## API Key
```bash
export JINA_API_KEY="jina_..."
```
Free tier: 10M tokens (no signup needed). Get key at https://jina.ai/reader/
## Pricing
- **Read:** ~$0.005/page (standard) | 3x for ReaderLM-v2
- **Search:** 10K tokens fixed + variable per result
- **Ground:** ~300K tokens/request (~30s latency)
## Why Jina Reader?
- **IP protection** — requests route through Jina's infra, not your server
- **Clean markdown** — readability extraction + optional ReaderLM-v2
- **Dynamic content** — headless Chrome renders JavaScript
- **Structured extraction** — JSON schema support for data extractionRelated Skills
book-reader
Read books (epub, pdf, txt) from various sources with progress tracking.
iyeque-pdf-reader
Extract text, search inside PDFs, and produce summaries.
twitter-reader
A comprehensive skill for reading and extracting data from X (formerly Twitter) tweets using multiple reliable data.
rss-reader
Monitor RSS and Atom feeds for content research.
safe-file-reader
Read files from documents directory safely
castreader
Read any web page aloud with natural AI voices. The only skill that extracts content from URLs (including Kindle, WeRead, Notion, Google Docs) and converts to MP3 audio with paragraph-level highlighting.
sergei-mikhailov-tg-channel-reader
Read and summarize posts from Telegram channels via MTProto (Pyrogram or Telethon)
rss-ai-reader
📰 RSS AI 阅读器 — 自动抓取订阅、LLM生成摘要、多渠道推送! 支持 Claude/OpenAI 生成中文摘要,推送到飞书/Telegram/Email。 触发条件: 用户要求订阅RSS、监控博客、抓取新闻、生成摘要、设置定时抓取、 "帮我订阅"、"监控这个网站"、"每天推送新闻"、RSS/Atom feed 相关。
4chan-reader
Browse 4chan boards and extract thread discussions into structured text files. Use when you need to fetch catalog information or specific thread content (including post text and file metadata) from 4chan boards like /a/, /vg/, /v/, etc.
jina-ai
Web reading and searching via Jina AI APIs.
paylock
Non-custodial SOL escrow for AI agent deals.
agent-reputation
summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.