Name: unidoc_parser

Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

3,891 stars

Best use case

Name: unidoc_parser is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

Teams using Name: unidoc_parser should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/unidoc-parser/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/aaiccee/unidoc-parser/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/unidoc-parser/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Name: unidoc_parser Compares

Feature / AgentName: unidoc_parserStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

Name: unidoc_parser
Description: Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling.

UniDoc Document Parser
======================

Overview
--------
Parse documents using UniDoc API for conversion to Markdown or JSON format. Supports both synchronous and asynchronous parsing with automatic status polling. Ideal for converting various document formats (PDF, DOC, DOCX, images) through a cloud-based API service.

Prereqs / when to read references
---------------------------------
If you encounter API errors, network issues, or need to understand the API endpoints, read:
* `references/unidoc-notes.md`

Quick start (single document)
-----------------------------
```bash
# Run from the skill directory
python scripts/unidoc_parse.py /path/to/file.pdf \
  --format md \
  --output ./unidoc-output \
  --mode sync
```

Options
-------
* `--format md|json` (default: `md`)
  - Output format: Markdown or JSON
* `--mode sync|async` (default: `sync`)
  - Synchronous mode: waits for conversion to complete
  - Asynchronous mode: polls status until completion
* `--func METHOD` (default: `unisound`)
  - Conversion method/algorithm to use
* `--output DIR` (default: `./unidoc-output`)
  - Output directory for converted files
- 
* `--uid UUID` (optional)
  - Custom user ID (auto-generated if not provided)

Output conventions
------------------
* Creates `./unidoc-output/<document_name>/` by default
* Markdown output: `output.md`
* JSON output: `output.json`
* Output filename preserves original document name

Notes
-----
* Requires network connectivity to UniDoc API (http://unidoc.uat.hivoice.cn)
* Supports multiple file formats: PDF, DOC, DOCX, PNG, JPG, etc.
* Async mode polls every 1 second until completion
* Max file size and rate limits depend on API service configuration
* For large files or batch processing, prefer async mode

Related Skills

content-parser

3891
from openclaw/skills

Extract and parse content from URLs. Triggers on: user provides a URL to extract content from, another skill needs to parse source material, "parse this URL", "extract content", "解析链接", "提取内容".

Data & Research

name: welight-wechat-layout-publish

3891
from openclaw/skills

description: Welight standalone skill for turning an article into WeChat Official Accounts compatible Markdown/HTML, presenting built-in theme choices, and publishing to WeChat as a draft or formal post when publishing prerequisites are already configured.

Content & Documentation

name: web3-weekly-report

3891
from openclaw/skills

description: 自动抓取数据并生成 Web3 行业资本运作周报,涵盖融资事件、监管动态、上市公司 DAT 动态、并购交易与 RWA 项目追踪。当用户提到"写周报"、"生成周报"、"整理本周融资"、"Web3 周报"、"资本运作周报"、"采编周报",或请求整理加密行业本周动态时,立即激活此 skill。即使用户只说"帮我写本周的",只要上下文涉及 Web3、加密、融资、RWA、DAT,也应激活。

name: notion-pro

3891
from openclaw/skills

description: Complete Notion API skill with Python CLI tool — auto-pagination, recursive blocks, 429 retry, and agent operation strategies.

resume-parser

3891
from openclaw/skills

智能简历解析系统,支持PDF/Word/图片格式简历的结构化信息提取、岗位匹配度分析、优化建议生成。完全本地运行,无需外部API。使用场景:(1) 解析上传的简历文件提取核心信息,(2) 输入岗位JD计算简历匹配度,(3) 生成简历优化建议,(4) 导出结构化简历数据。

multimodal-parser

3891
from openclaw/skills

Unified multi-modal content parser for images, PDF, DOCX, audio, auto OCR/transcription, output structured text for LLM processing

audio-rename

3891
from openclaw/skills

Rename audio files with Chinese/special characters to simple English names for mlx-stt compatibility.

name: wechat_messaging

3891
from openclaw/skills

description: 通过微信向好友发送消息。流程:查询好友 -> 确认目标 -> 发送内容。

name: wechat_operate

3891
from openclaw/skills

description: 通过微信进行社交管理与消息发送。流程:查询目标(好友/群聊/成员) -> 确认目标 -> 发送内容(文本/图片/文件)。

document-parser

3891
from openclaw/skills

高精度文档解析技能,从 PDF、图片、Word 文档中提取结构化数据。

name: Snipara MCP - Smart Documentation Search

3891
from openclaw/skills

description: Find answers in your codebase 10x faster with semantic search. Query multiple repos at once. AI remembers your preferences across sessions.

pdf-parser

3891
from openclaw/skills

使用 MinerU API 将 PDF 解析为 Markdown,支持公式、表格、OCR。提供本地文件和在线 URL 两种解析方式。触发条件:(1) 用户说"解析 PDF [路径]",(2) 用户说"将 PDF 转为 Markdown",(3) 在 paper-workflow 中自动调用。使用场景:学术论文解析、文档提取、知识库构建。