screenshot-tool

网页截图 + 文档截图工具。支持网页全页截图、PPT/Word/Excel/PDF 转高清图片。保留原始样式,300 DPI 高清输出。

3,807 stars

Best use case

screenshot-tool is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

网页截图 + 文档截图工具。支持网页全页截图、PPT/Word/Excel/PDF 转高清图片。保留原始样式,300 DPI 高清输出。

Teams using screenshot-tool should expect a more consistent output, faster repeated execution, less prompt rewriting, better workflow continuity with your supporting tools.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.
  • You already have the supporting tools or dependencies needed by this skill.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/screenshot-tool/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/834948655/screenshot-tool/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/screenshot-tool/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How screenshot-tool Compares

Feature / Agentscreenshot-toolStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

网页截图 + 文档截图工具。支持网页全页截图、PPT/Word/Excel/PDF 转高清图片。保留原始样式,300 DPI 高清输出。

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Screenshot Tool - 网页 & 文档截图工具

支持网页截图和文档转高清图片,保留原始样式。

## 功能

| 功能 | 说明 |
|------|------|
| **网页截图** | 使用 headless 浏览器截图,支持整页、单页 |
| **文档转图** | PPT/Word/Excel/PDF 转 300 DPI 高清图片 |
| **高清输出** | 4000×2250 像素,适合打印和展示 |

## 依赖安装

### 必需依赖

| 依赖 | 用途 | 安装命令 |
|------|------|---------|
| **agent-browser** | 网页截图 | `npm install -g agent-browser && agent-browser install` |
| **LibreOffice** | 文档转 PDF | `sudo apt-get install -y libreoffice-impress libreoffice-writer libreoffice-calc` |
| **poppler-utils** | PDF 处理 | `sudo apt-get install -y poppler-utils` |
| **Python 库** | PDF 转图片 | `pip3 install pdf2image pillow` |

### 安装步骤

```bash
# 1. 安装 agent-browser(网页截图必需)
npm install -g agent-browser
agent-browser install
agent-browser install --with-deps  # 如需要系统依赖

# 2. 安装 LibreOffice(文档转换必需)
sudo apt-get install -y libreoffice-impress libreoffice-writer libreoffice-calc

# 3. 安装 poppler-utils(PDF 处理必需)
sudo apt-get install -y poppler-utils

# 4. 安装 Python 依赖
pip3 install pdf2image pillow

# 5. 安装中文字体(可选,用于中文文档)
sudo apt-get install -y fonts-wqy-zenhei fonts-wqy-microhei fonts-noto-cjk
```

### 验证安装

```bash
# 验证 agent-browser
agent-browser --version

# 验证 LibreOffice
libreoffice --version

# 验证 poppler
which pdftoppm pdfinfo
```

## 使用方法

### 1. 网页截图

```bash
# 截图单个网页
python3 skills/screenshot-tool/scripts/web_screenshot.py --url "https://example.com" --output page.png

# 截图并滚动(长页面)
python3 skills/screenshot-tool/scripts/web_screenshot.py --url "https://example.com" --full-page --output page.png
```

### 2. 文档转图片

```bash
# PPT/Word/Excel/PDF 转图片
python3 skills/screenshot-tool/scripts/doc_screenshot.py --input file.pptx --output-dir ./images

# 指定 DPI(默认 300)
python3 skills/screenshot-tool/scripts/doc_screenshot.py --input file.pdf --dpi 200 --output-dir ./images
```

### 3. 使用 agent-browser 截图

```bash
# 打开网页
agent-browser open "https://example.com" --timeout 60000

# 截图
agent-browser screenshot output.png --full

# 关闭浏览器
agent-browser close
```

## 支持的格式

### 文档格式
| 格式 | 扩展名 | 状态 |
|------|--------|------|
| PowerPoint | .pptx, .ppt | ✅ 支持 |
| Word | .docx, .doc | ✅ 支持 |
| Excel | .xlsx, .xls | ✅ 支持 |
| PDF | .pdf | ✅ 支持 |
| OpenDocument | .odp, .odt, .ods | ✅ 支持 |

### 网页截图
| 方式 | 说明 | 依赖 |
|------|------|------|
| agent-browser | 使用 headless Chrome | **agent-browser** |
| OpenClaw browser | 内置浏览器工具 | OpenClaw 内置 |

## 输出规格

| 参数 | 默认值 | 说明 |
|------|--------|------|
| DPI | 300 | 分辨率 |
| 格式 | PNG | 图片格式 |
| 尺寸 | 4000×2250 | 16:9 比例 |

## 示例

### 示例1:网页截图
```bash
# 截图京东首页
python3 skills/screenshot-tool/scripts/web_screenshot.py \
  --url "https://www.jd.com" \
  --output jd_homepage.png \
  --wait 5
```

### 示例2:PPT 转图片
```bash
# 转换整个 PPT
python3 skills/screenshot-tool/scripts/doc_screenshot.py \
  --input presentation.pptx \
  --output-dir ./slides \
  --dpi 300
```

### 示例3:PDF 转图片
```bash
# 转换 PDF 前5页
python3 skills/screenshot-tool/scripts/doc_screenshot.py \
  --input document.pdf \
  --output-dir ./pages \
  --first-page 1 \
  --last-page 5
```

## 流程说明

### 文档转图片流程
```
PPT/Word/Excel → LibreOffice → PDF → pdf2image → PNG (300 DPI)
                                         ↑
                                    依赖: poppler-utils
```

### 网页截图流程
```
URL → agent-browser (headless Chrome) → Screenshot → PNG
              ↑
        依赖: agent-browser CLI
```

## 故障排除

### LibreOffice 转换失败
```bash
# 检查 LibreOffice 安装
libreoffice --version

# 手动转换测试
libreoffice --headless --convert-to pdf file.pptx
```

### pdf2image 错误
```bash
# 检查 poppler 安装
which pdftoppm pdfinfo

# 重新安装
sudo apt-get install -y poppler-utils
```

### 中文字体显示问题
```bash
# 安装中文字体
sudo apt-get install -y fonts-wqy-zenhei fonts-wqy-microhei fonts-noto-cjk
```

## 文件结构

```
skills/screenshot-tool/
├── SKILL.md              # 本文件
├── scripts/
│   ├── web_screenshot.py    # 网页截图脚本
│   └── doc_screenshot.py    # 文档转图片脚本
└── README.md             # 详细说明
```

## License

MIT

Related Skills

AI Coding Toolkit — Master Every AI Coding Assistant

3807
from openclaw/skills

> The complete methodology for 10X productivity with AI-assisted development. Covers Cursor, Windsurf, Cline, Aider, Claude Code, GitHub Copilot, and more — tool-agnostic principles that work everywhere.

china-tools-sourcing

3807
from openclaw/skills

Comprehensive tools industry sourcing guide for international buyers – provides detailed information about China's hand tools, power tools, garden tools, measuring tools, and industrial tool manufacturing clusters, supply chain structure, regional specializations, and industry trends (2026 updated).

github-tools

3807
from openclaw/skills

Interact with GitHub using the `gh` CLI. Use `gh issue`, `gh pr`, `gh run`, and `gh api` for issues, PRs, CI runs, and advanced queries.

DevOps & Infrastructure

web-screenshot

3807
from openclaw/skills

Capture screenshots of web pages running on local or remote servers using Puppeteer in headless Chromium. Use when user asks to screenshot web pages, capture web UI, take website screenshots, or document web application interfaces. Supports login-required SPAs (Vue/React/Angular) by performing form-based authentication before navigating. Generates screenshots and an optional result.json with per-page descriptions.

chrome-devtools-mcp-manager

3807
from openclaw/skills

Manage chrome-devtools-mcp service and OpenClaw's built-in Chrome browser for MCP-based browser automation. Use when user needs to use chrome-devtools-mcp functionality, ensure the browser is ready for MCP operations, or manage the browser/MCP lifecycle.

pydantic-ai-tool-system

3807
from openclaw/skills

Register and implement PydanticAI tools with proper context handling, type annotations, and docstrings. Use when adding tool capabilities to agents, implementing function calling, or creating agent actions.

🔮 Divination — Oracle Toolkit for AI Agents

3807
from openclaw/skills

*"At every crossroads lies a message. Chance is the messenger. You are the reader."*

prompt-engineer-toolkit

3807
from openclaw/skills

Analyzes and rewrites prompts for better AI output, creates reusable prompt templates for marketing use cases (ad copy, email campaigns, social media), and structures end-to-end AI content workflows. Use when the user wants to improve prompts for AI-assisted marketing, build prompt templates, or optimize AI content workflows. Also use when the user mentions 'prompt engineering,' 'improve my prompts,' 'AI writing quality,' 'prompt templates,' or 'AI content workflow.'

product-manager-toolkit

3807
from openclaw/skills

Comprehensive toolkit for product managers including RICE prioritization, customer interview analysis, PRD templates, discovery frameworks, and go-to-market strategies. Use for feature prioritization, user research synthesis, requirement documentation, and product strategy development.

free-tool-strategy

3807
from openclaw/skills

When the user wants to plan, evaluate, or build a free tool for marketing purposes — lead generation, SEO value, or brand awareness. Also use when the user mentions "engineering as marketing," "free tool," "marketing tool," "calculator," "generator," "interactive tool," "lead gen tool," "build a tool for leads," or "free resource." This skill bridges engineering and marketing — useful for founders and technical marketers.

jules-tools-skill

3807
from openclaw/skills

Interface with Google's Jules Tools CLI to manage AI coding sessions.

WhatsApp Business Suite — AI Leads, Channels, Campaigns & 32 MCP Tools

3807
from openclaw/skills

Automate WhatsApp at scale — mine leads from groups with AI, broadcast to channel followers, bulk message with ban-safe delays, schedule campaigns, auto-reply in your voice, collect reviews, and track delivery. 90+ REST endpoints, 32 MCP tools for Claude & GPT, Python SDK. No Meta Business API required. Free tier available.