skywork-document

Generate professional documents in multiple formats (docx, pdf, html, md) from scratch or based on user files. Supports web search for up-to-date content. Use when the expected output is longer than a short answer and benefits from structure and formatting. Do NOT use for short plain-text answers, code files, or casual Q&A.

242 stars

byaiskillstore

View on GitHub Installation ↓

Best use case

skywork-document is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. Generate professional documents in multiple formats (docx, pdf, html, md) from scratch or based on user files. Supports web search for up-to-date content. Use when the expected output is longer than a short answer and benefits from structure and formatting. Do NOT use for short plain-text answers, code files, or casual Q&A.

Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.

Practical example

Example input

Use the "skywork-document" skill to help with this workflow task. Context: Generate professional documents in multiple formats (docx, pdf, html, md) from scratch or based on user files. Supports web search for up-to-date content. Use when the expected output is longer than a short answer and benefits from structure and formatting. Do NOT use for short plain-text answers, code files, or casual Q&A.

Example output

A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.

When to use this skill

Use this skill when you want a reusable workflow rather than writing the same prompt again and again.

When not to use this skill

Do not use this when you only need a one-off answer and do not need a reusable workflow.
Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/skywork-document/SKILL.md --create-dirs "https://raw.githubusercontent.com/aiskillstore/marketplace/main/skills/skyworkai/skywork-document/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/skywork-document/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How skywork-document Compares

Feature / Agent	skywork-document	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Doc — Professional Document Generator

Generate professional, beautifully formatted documents by calling the Skywork Office Doc API.

---

## Authentication (Required First)

Before using this skill, authentication must be completed. Run the auth script first:

```bash
# Authenticate: checks env token / cached token / browser login
python3 <skill-dir>/scripts/skywork_auth.py || exit 1
```

**Token priority**:
1. Environment variable `SKYBOT_TOKEN` → if set, use directly
2. Cached token file `~/.skywork_token` → validate via API, if valid, use it
3. No valid token → opens browser for login, polls until complete, saves token

**IMPORTANT - Login URL handling**: If script output contains a line starting with `[LOGIN_URL]`, you **MUST** immediately send that URL to the user in a clickable message (e.g. "Please open this link to log in: <url>"). The user may be in an environment where the browser cannot open automatically, so always surface the login URL.

---


## Workflow

### Step 0: Intent Recognition (CRITICAL - Do This First)

**Before calling any script, analyze the user's request and determine**:

1. **Does the user provide reference files, or imply that certain files are needed to proceed with the writing task?**
   - Look for file paths, attachments, or mentions like "based on this PDF", "use the uploaded document". If you gathered info beforehand (e.g., web search, other tools) that would help the writing task, save it to disk as files and pass them as reference files in Step 1.
   - If YES: find/extract file paths → proceed to Step 1
   - If NO: skip to Step 2

2. **What language should the output be in?**
   - Analyze the user's request language or explicit requirement. If unspecified, infer from the user's language or the language used in uploaded files.
   - Set `--language` parameter: `English`, `中文简体`, etc.
   - Default: `English`  

3. **What format does the user want?**
   - Look for keywords: "Word document" → `docx`, "PDF" → `pdf`, "HTML" → `html`, "Markdown" → `md`
   - Default if not specified: `docx`
   - **Supported formats**: `docx`, `pdf`, `html`, `md`

4. **How to write the content prompt?**
   - The `--content` parameter is like a **rewrite query**
   - Synthesize user's requirements (possibly from multiple conversation turns)
   - Be specific: describe structure, sections, tone, key points. Avoid being overly verbose or straying far from the user's original requirements; stay close to their intent to ensure accuracy.


### Step 1: Parse Reference Files (If User Provides Files)

**IMPORTANT**: 
- `parse_file.py` processes **one file at a time**. For multiple files, call it multiple times. 
- Quote any file path that contains spaces so arguments are passed correctly. 
- Parse all reference material the user needs for the writing task as files. If a file was already parsed earlier in the session, skip re-parsing and reuse its `file_id`.

**Single file**:
```bash
python3 <skill-dir>/scripts/parse_file.py /path/to/reference.pdf
```

**Multiple files** (call the script once for each file; you can run these in parallel to speed things up):
```bash
# Parse file 1
python3 <skill-dir>/scripts/parse_file.py /path/to/file1.pdf

# Parse file 2
python3 <skill-dir>/scripts/parse_file.py /path/to/file2.xlsx

# Parse file 3
python3 <skill-dir>/scripts/parse_file.py "/path/to/file3 with blank in it.docx"
```

**Each script call outputs**:
```
[parse] File: reference.pdf (2,458,123 bytes)
...
[success] File parsed!
  File ID:    2032146192467681280
  ...
PARSED_FILE: {"file_id":"2032146192467681280","filename":"reference.pdf","url":""}
```

**Extract all `PARSED_FILE` outputs** and collect them into a JSON array:
```json
[
  {"file_id":"2032146192467681280","filename":"file1.pdf","url":""},
  {"file_id":"2032146192467681281","filename":"file2.xlsx","url":""},
  {"file_id":"2032146192467681282","filename":"file3.docx","url":""}
]
```

This array will be passed to `create_doc.py` via the `--files` parameter below.

### Step 2: Create Document

**Without reference files**:
```bash
python3 <skill-dir>/scripts/create_doc.py \
  --title "Document_Title" \
  --content "Detailed content prompt based on user requirements..." \
  --language English \
  --format docx
```

**With reference files** (use the collected file_ids from Step 1):
```bash
python3 <skill-dir>/scripts/create_doc.py \
  --title "Analysis_Report" \
  --content "Based on the uploaded reference files, create a comprehensive analysis report..." \
  --files '[{"file_id":"id1","filename":"file1.pdf","url":""},{"file_id":"id2","filename":"file2.xlsx","url":""}]' \
  --language English \
  --format docx
```

> The `title` field should not contain spaces.

**Output**:
```
[doc] Creating document: "Analysis Report"
...
[success] Document created!
  File ID:   abc-123
  Path:      /output/doc/some_file.html
  URL:       https://...
  Time:      15.2s
```

### Step 3: Deliver Result

After `create_doc.py` finishes, parse the final JSON output. It contains two ways for the user to access the document — **always provide both**:

- **`file_url`** — the remote download link (cloud URL). Include it as a clickable hyperlink so the user can open it in a browser or share it.
- **`file_path`** — the absolute local path where the file was automatically downloaded on their machine. Mention this path explicitly so the user can find the file right away without manual downloading.

Example reply (adapt wording to user's language):

> The document is ready!
> - **Download link**: [巴西电网行业及充电桩市场调研报告.docx](https://...)
> - **Local file**: `/Users/alice/Downloads/巴西电网行业及充电桩市场调研报告.docx`

If `file_path` is empty (download failed), still provide `file_url` and inform the user they can download manually.

---

## Script Parameters

### parse_file.py
- `file` - Path to the reference file (required)
- `--json` - Output full result as JSON (optional)

**Key Output**: `PARSED_FILE: <json>` — extract this for Step 2

### create_doc.py
- `--title` - Document title (required)
- `--content` - **Content prompt** describing what to write (required)
  - This is like a rewrite query — synthesize user's requirements
  - Be specific about structure, sections, tone, key points
- `--files` - JSON array of file objects from parse_file.py (optional)
  - Format: `[{"file_id":"xxx","filename":"yyy","url":""}]`
- `--language` - Output language (optional, default: `English`)
  - Examples: `English`, `中文简体`, `中文繁體`, `日本語`, `한국어`, `Français`, `Deutsch`, `Español`, ...
- `--format` - Output format (optional, default: `docx`)
  - **Supported**: `docx`, `pdf`, `html`, `md`

---

## Important Notes

1. **Intent Recognition First** - Always analyze the user's request before calling scripts.
2. **Web Search Built-In** - The Doc API automatically performs web searches on demand to gather relevant content for document creation. Whether you pre-search for materials externally or not is entirely optional—either approach works fine.
3. **File ID is the Bridge** - `parse_file.py` outputs `file_id` → pass to `create_doc.py` via `--files`.
4. **Server Fetches Content** - No need to paste `parsed_content` manually; the server retrieves it using `file_id`.
5. **Content is Rewrite Query** - Synthesize the user's requirements into a clear, detailed prompt. Even when the user's instructions are long or complex, capture every requirement—don't omit anything.
6. **Generation Takes Time** - Document generation typically takes 5-10 minutes, sometimes longer for complex documents.
7. **Scripts Wait Automatically** - `create_doc.py` uses SSE (Server-Sent Events) to maintain a long connection and receives real-time progress updates. The script will automatically wait up to 3~10 minutes for completion. **No manual polling needed** - just wait for the script to finish and it will output the result.
8. **Progress Display** - The script shows a real-time progress bar during generation. The AI agent should relay this to the user to set expectations.
9. **Final Document Delivery** - **CRITICAL**: Upon successful execution of `create_doc.py`, the output JSON contains both `file_url` (remote download link) and `file_path` (local path where the file was automatically saved). **You MUST proactively return both to the user**: the clickable `file_url` so they can share or open it online, and the `file_path` so they can locate it immediately on their machine. If `file_path` is empty, notify the user and provide `file_url` for manual download.

---

## Error Handling

| Error | Solution |
|-------|----------|
| `NO_TOKEN` / `INVALID_TOKEN` | Run auth workflow |
| `Cannot reach server` | Check network connection |
| `JSON parse error` | Use double quotes in --files JSON |
| **Insufficient benefit** | Script or log may show e.g. `Insufficient benefit. Please upgrade your account at {url}` — see below |

### How to reply when benefit is insufficient

When you detect the above, **reply in the user's current language** — do not echo the English message. Use this pattern:

- Convey: "Sorry, document generation failed. This skill requires upgrading your Skywork membership to use." then a single call-to-action link.
- **Format**: One short sentence in the user's language + a link like `[Upgrade now →](url)` or the equivalent in their language.
- **URL**: Extract the upgrade URL from the log/script output (e.g. the `at https://...` part).

## Technical Notes
- Generation takes 5-10 minutes, set sufficient timeout. Because `create_doc.py` may run for a long time. As SSE events arrive, display each stage to the user. This keeps them informed during the generation.

Related Skills

skywork-search

242

from aiskillstore/marketplace

Search the web for real-time information using the Skywork web search API. Use this skill whenever the user needs up-to-date information from the internet — for example, researching a topic, looking up recent events, finding facts or statistics, gathering material for a document or presentation, or answering questions that require current data. Also trigger when the user says things like "search for", "look up", "find information about", "what's the latest on", or any request that implies needing information beyond your training data.

skywork-ppt

242

from aiskillstore/marketplace

Generate PPTs from topics or templates, edit existing presentations via natural language, or perform local file operations (delete/reorder/merge slides). Trigger on requests to create, modify, or manipulate .pptx files. Built on Nano Banana 2. Requires Python 3.8+.

skywork-music-maker

242

from aiskillstore/marketplace

Create professional music with Mureka AI API — songs, instrumentals, and lyrics from natural language descriptions in any language. Use when users want to generate a song, create a beat or instrumental, write lyrics, clone vocals, upload reference tracks, or do anything related to AI music creation, even casual requests like "make me a chill lo-fi beat".

skywork-excel

242

from aiskillstore/marketplace

Excel generator with AI-powered data analysis, charts, formulas, and web search. Create spreadsheets, analyze CSV/Excel/PDF files, generate HTML reports, and get real-time data.

skywork-design

242

from aiskillstore/marketplace

Generate or edit images via backend Skywork Image API. Use for any image creation, poster design, logo design, visual asset generation, or image modification request. Supports text-to-image and image-to-image editing with aspect ratio and resolution control.

documentation-templates

242

from aiskillstore/marketplace

Documentation templates and structure guidelines. README, API docs, code comments, and AI-friendly documentation.

documentation-generation-doc-generate

242

from aiskillstore/marketplace

You are a documentation expert specializing in creating comprehensive, maintainable documentation from code. Generate API docs, architecture diagrams, user guides, and technical references using AI-powered analysis and industry best practices.

code-documentation-doc-generate

242

from aiskillstore/marketplace

code-documentation-code-explain

242

from aiskillstore/marketplace

You are a code education expert specializing in explaining complex code through clear narratives, visual diagrams, and step-by-step breakdowns. Transform difficult concepts into understandable explanations.

azure-search-documents-ts

242

from aiskillstore/marketplace

Build search applications using Azure AI Search SDK for JavaScript (@azure/search-documents). Use when creating/managing indexes, implementing vector/hybrid search, semantic ranking, or building agentic retrieval with knowledge bases.

azure-search-documents-py

242

from aiskillstore/marketplace

Azure AI Search SDK for Python. Use for vector search, hybrid search, semantic ranking, indexing, and skillsets. Triggers: "azure-search-documents", "SearchClient", "SearchIndexClient", "vector search", "hybrid search", "semantic search".

azure-search-documents-dotnet

242

from aiskillstore/marketplace

Azure AI Search SDK for .NET (Azure.Search.Documents). Use for building search applications with full-text, vector, semantic, and hybrid search. Covers SearchClient (queries, document CRUD), SearchIndexClient (index management), and SearchIndexerClient (indexers, skillsets). Triggers: "Azure Search .NET", "SearchClient", "SearchIndexClient", "vector search C#", "semantic search .NET", "hybrid search", "Azure.Search.Documents".