pdf-parser
使用 MinerU API 将 PDF 解析为 Markdown,支持公式、表格、OCR。提供本地文件和在线 URL 两种解析方式。触发条件:(1) 用户说"解析 PDF [路径]",(2) 用户说"将 PDF 转为 Markdown",(3) 在 paper-workflow 中自动调用。使用场景:学术论文解析、文档提取、知识库构建。
Best use case
pdf-parser is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
使用 MinerU API 将 PDF 解析为 Markdown,支持公式、表格、OCR。提供本地文件和在线 URL 两种解析方式。触发条件:(1) 用户说"解析 PDF [路径]",(2) 用户说"将 PDF 转为 Markdown",(3) 在 paper-workflow 中自动调用。使用场景:学术论文解析、文档提取、知识库构建。
Teams using pdf-parser should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/mineru-pdf-parser/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How pdf-parser Compares
| Feature / Agent | pdf-parser | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
使用 MinerU API 将 PDF 解析为 Markdown,支持公式、表格、OCR。提供本地文件和在线 URL 两种解析方式。触发条件:(1) 用户说"解析 PDF [路径]",(2) 用户说"将 PDF 转为 Markdown",(3) 在 paper-workflow 中自动调用。使用场景:学术论文解析、文档提取、知识库构建。
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
ChatGPT vs Claude for Agent Skills
Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
SKILL.md Source
# PDF Parser Skill 基于 [MinerU](https://github.com/opendatalab/MinerU) 提供 PDF 解析能力。 ## 功能 - **PDF 解析**: 将 PDF 转换为 Markdown 格式 - **公式识别**: 支持 LaTeX 公式提取 - **表格识别**: 自动识别并转换表格结构 - **OCR**: 支持图片型 PDF 文字识别 - **多语言**: 支持中文、英文,日文、韩文等 ## ⚠️ 安装前必读 **使用本技能即表示:** 1. 你愿意提供你的 MinerU API Token (`MINERU_TOKEN`) 2. Token 会被发送给 https://mineru.net/ 3. 确认 MinerU 服务可信,接受其隐私政策 4. 已在本地源码中确认无额外意外行为 ## 前提条件 ### 1. 安装依赖 ```bash pip install requests ``` ### 2. 获取 MinerU Token 访问 <https://mineru.net/> 注册并获取 API Token。 ### 3. 设置环境变量 **Windows (PowerShell):** ```powershell $env:MINERU_TOKEN = "your-token-here" ``` **macOS / Linux:** ```bash export MINERU_TOKEN=your-token-here ``` ## 支持的引擎 | 引擎 | 说明 | |------|------| | vlm | VLM 引擎(默认) | | pipeline | 管道引擎 | | MinerU-HTML | HTML 输出 | ## 快速开始 ```bash # 解析 PDF (默认 vlm 引擎) python scripts/mineru_api.py -f <pdf路径> --wait # 指定引擎 python scripts/mineru_api.py -f <pdf路径> --engine pipeline --wait ``` ## 选项 | 参数 | 说明 | 默认值 | |------|------|--------| | -f, --files | 本地 PDF 文件 | - | | --engine | 解析引擎 | vlm | | --lang | 语言 (ch/en/ja/ko) | ch | | --wait | 等待解析完成 | 否 | ## 环境变量 | 变量 | 必填 | 说明 | |------|------|------| | MINERU_TOKEN | 是 | MinerU API Token | ## 输出 解析结果保存在 `~/.openclaw/MinerU_Results/` 目录下。 ## 工作流 1. 设置 `MINERU_TOKEN` 环境变量 2. 执行解析命令 3. 等待解析完成 4. 读取 full.md 分析内容 5. 根据内容重命名目录
Related Skills
content-parser
Extract and parse content from URLs. Triggers on: user provides a URL to extract content from, another skill needs to parse source material, "parse this URL", "extract content", "解析链接", "提取内容".
resume-parser
智能简历解析系统,支持PDF/Word/图片格式简历的结构化信息提取、岗位匹配度分析、优化建议生成。完全本地运行,无需外部API。使用场景:(1) 解析上传的简历文件提取核心信息,(2) 输入岗位JD计算简历匹配度,(3) 生成简历优化建议,(4) 导出结构化简历数据。
multimodal-parser
Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing
document-parser
高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。
Name: unidoc_parser
Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.
Name: u2-doc-parser
Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.
clinicaltrials-gov-parser
Monitor and summarize competitor clinical trial status changes from ClinicalTrials.gov. Trigger: When user asks to track clinical trials, monitor trial status changes, get updates on specific trials, or analyze competitor trial activities. Use cases: Pharma competitive intelligence, trial monitoring, status tracking, recruitment updates, completion alerts.
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.
baidu-search
Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.