Best use case
longform-visual-notes is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
MediaClaw内容生产的核心配图skill。当有长篇文章需要转化为可传播的视觉笔记时调用。
Teams using longform-visual-notes should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/longform-visual-notes/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How longform-visual-notes Compares
| Feature / Agent | longform-visual-notes | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
MediaClaw内容生产的核心配图skill。当有长篇文章需要转化为可传播的视觉笔记时调用。
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# 长文知识提取转图
> 将深度长文降维为3-5张高真实感视觉笔记图
## 定位
MediaClaw内容生产的核心配图skill。当有长篇文章需要转化为可传播的视觉笔记时调用。
**与其他skill的关系**:
- `content-ops-toolkit` → 素材配图阶段调用本skill
- `daily-gzh-content` → 公众号文章配图
- `daily-xhs-content` → 小红书笔记配图
- `daily-douyin-content` → 抖音图文配图
- `article-material-collect` → 素材收集后可补充本skill生成图
## 默认配置
| 配置项 | 值 |
|--------|-----|
| 生成模型 | qingyun-api / gemini-3-pro-image |
| 图片数量 | 3-5张 |
| 图片文字 | 中文为主 |
| 输出比例 | 16:9(公众号/抖音)或 3:4(小红书) |
| 作者标注 | 有作者则在图底部标明 |
| 禁止项 | 不得显示"由xx生成该图片"等水印 |
## Workflow
### Phase 1: 知识拆解与分镜规划
阅读输入文章,输出中文摘要和分镜方案:
```
📊 知识拆解与分镜规划
- 图1:黄金标题与核心定调 — 概念图/手写大纲 — [核心文字概览]
- 图2:核心竞争力矩阵 — 手写对比表/数据图表 — [核心文字概览]
- 图3:深度分析 — 思维导图/双边对比图 — [核心文字概览]
- 图4:总结与展望 — 白板架构图/手写红框总结 — [核心文字概览]
```
**分镜策略**:
| 文章类型 | 推荐分镜 | 视觉形式 |
|---------|---------|---------|
| 测评/对比 | 4-5张 | 封面→对比矩阵→细节分析→终端截图→总结 |
| 行业分析 | 3-4张 | 概念图→数据图→趋势图→总结 |
| 教程/指南 | 4-5张 | 封面→流程图→代码图→架构图→总结 |
| 产品评测 | 3-4张 | 封面→参数对比→使用场景→总结 |
### Phase 2: 生成图像提示词
为每张图生成纯英文Prompt,**严格遵循4模块结构**:
#### 模块结构
```
[模块1: 主提示词 Main Description]
描述整体环境、视角、材质、光线。
Example: A photorealistic, ultra-clear, high-resolution close-up photograph of a hand-written note on a piece of textured paper...
[模块2: 内容和排版 Content and Layout (Verbatim)]
极度详细地规定每一个文本的位置、颜色、排版层级。
使用粗体和引号圈定必须生成的文字。
1. Top: Main title "[提取的中文标题]"
2. Chart Structure: Three columns "[列名1]" | "[列名2]"...
[模块3: 上下文细节 Context & Environment Details]
背景环境、增加真实感的细节。
Example: The paper note is on a smartphone screen. Visible at the top is the phone status bar... natural imperfections...
[模块4: 质量和风格关键词 Quality & Style Keywords]
Example: Hand-written, detailed texture, legible, varied ink colors, accurate content reproduction, photorealistic, 8k...
```
### Phase 3: 调用模型生成图片
**调用方式**:通过 qingyun-api 使用 gemini-3-pro-image 模型。
```bash
# 调用 qingyun-api 生成图片
node ~/clawd/skills/qingyun-api/scripts/img-gen.js \
--prompt "<英文提示词>" \
--model "gemini-3-pro-image" \
--output "<输出路径>.png" \
--size "1536x1024"
```
如果 qingyun-api 脚本不可用,备选方案:
```bash
# 备选: 使用 OpenClaw image_generate
image_generate(
prompt="<英文提示词>",
model="qingyun/gemini-3-pro-image",
size="1536x1024",
filename="<输出名>.png"
)
```
最终兜底:
```bash
# 最终兜底: image_generate 默认模型
image_generate(
prompt="<英文提示词>",
size="1536x1024",
filename="<输出名>.png"
)
```
**生成优先级链**:
```
1. qingyun-api gemini-3-pro-image(首选,文字生成质量最高)
2. image_generate model=qingyun/gemini-3-pro-image
3. image_generate 默认模型(兜底)
```
### Phase 4: 保存与整理
```
{OUTPUT_DIR}/素材/
├── visual-note-01-封面.png
├── visual-note-02-对比矩阵.png
├── visual-note-03-深度分析.png
├── visual-note-04-总结.png
└── prompts/
├── 01-cover.md
├── 02-comparison.md
├── 03-analysis.md
└── 04-summary.md
```
**每个prompt文件保存完整的4模块英文提示词**,便于复用和调整。
## 视觉风格规范
### 可选视觉载体
| 风格 | 描述 | 适用场景 |
|------|------|---------|
| **Hand-written Note** | 手机/平板/纸张上的高密度手写图表 | 知识笔记、测评总结 |
| **Mind Map** | 手绘感或极简现代风白板思维导图 | 概念梳理、逻辑拆解 |
| **Architecture Diagram** | 专业批注的系统演算图 | 技术架构、流程分析 |
| **Comparison Chart** | 高对比度重点标红的参数对比表 | 产品对比、功能矩阵 |
### 强制规则
1. **中文文字** — 图片中必须包含中文文字内容,清晰可读
2. **作者标注** — 如原文有作者,图底部标明作者信息
3. **禁止生成水印** — 不得显示"由xx生成该图片"、"AI generated"等
4. **真实感** — 模拟真实场景(纸张纹理、手机屏幕、白板等)
5. **高信息密度** — 每张图包含足够多的有效信息,不是空洞装饰
### 排版规范
```
┌─────────────────────────────────┐
│ [主标题 - 大号粗体] │ ← 顶部标题区
│ [副标题 - 中号] │
├─────────────────────────────────┤
│ │
│ [核心内容区域] │ ← 主体内容区
│ - 文字说明、数据、图表 │ 占图面70-80%
│ - 对比表格、流程箭头 │
│ - 关键数据用颜色/加粗突出 │
│ │
├─────────────────────────────────┤
│ [作者: xxx | 来源: xxx] │ ← 底部信息区
└─────────────────────────────────┘
```
## 提示词样板
### 样板1: 手写笔记风
```
[Main Description]
A photorealistic, ultra-clear, high-resolution close-up photograph of a hand-written technical note on a piece of cream-colored textured paper. The paper is slightly tilted on a dark wooden desk. Natural lighting from the left creates subtle shadows. A black gel pen and a red highlighter are visible at the bottom corner.
[Content and Layout]
1. Top center, written in bold black marker: "Claude Code vs Codex — 终极对决"
2. Below the title, a horizontal red line separator
3. Left column header: "Claude Code" in blue ink, with 4 bullet points:
- "✅ 原生终端体验"
- "✅ 200k上下文窗口"
- "✅ 实时代码执行"
- "✅ Anthropic生态"
4. Right column header: "Codex" in green ink, with 4 bullet points:
- "✅ OpenAI模型驱动"
- "✅ 多模型切换"
- "✅ 云端沙箱"
- "✅ 插件市场"
5. Bottom right corner, small text: "作者: Daniel Li"
6. Key comparisons circled in red highlighter
[Context & Environment]
The paper has slight coffee stain in the corner. A phone edge is visible at the top of the frame. Natural handwriting imperfections visible. Slight paper grain texture.
[Quality & Style Keywords]
Hand-written, detailed texture, legible Chinese characters, varied ink colors (black, blue, red), accurate content reproduction, photorealistic, 8k resolution, warm ambient lighting, depth of field, professional knowledge note aesthetic.
```
### 样板2: 思维导图风
```
[Main Description]
A clean, modern mind map on a large whiteboard in a tech startup office. The mind map is drawn with colorful dry-erase markers. Clean white background with subtle grid pattern visible.
[Content and Layout]
1. Center node (large red circle): "AI编程工具"
2. Branch 1 (blue, upper left): "Claude Code" → sub-nodes: "终端原生", "200k上下文", "Anthropic"
3. Branch 2 (green, upper right): "Codex" → sub-nodes: "多模型", "云端", "OpenAI"
4. Branch 3 (orange, bottom left): "共同点" → sub-nodes: "代码补全", "项目理解", "Git集成"
5. Branch 4 (purple, bottom right): "选择建议" → sub-nodes: "个人→Claude", "团队→Codex"
[Context & Environment]
Whiteboard has slight smudge marks. A coffee cup shadow on the left edge. Office background slightly blurred.
[Quality & Style Keywords]
Clean mind map, colorful markers, whiteboard texture, professional tech aesthetic, modern office, legible text, clear hierarchy, 8k resolution.
```
## 调用入口
### 独立调用
```
请将以下文章转化为视觉笔记图:
- 文章路径: /path/to/article.md
- 图片数量: 3-5
- 目标平台: gzh/xhs/douyin
- 输出目录: /path/to/output/素材/
- 作者: Daniel Li(如有)
```
### 从content-ops-toolkit调用
在素材配图阶段,优先使用本skill:
```
素材配图策略(优先级):
1. longform-visual-notes — 长文核心知识转图(首选)
2. content-cover-gen — 封面图生成
3. image_generate — 兜底AI生图
```
### 从daily系列调用
```
# daily-gzh-content 配图阶段
优先调用 longform-visual-notes 生成文章配套视觉笔记图
# daily-xhs-content 配图阶段
优先调用 longform-visual-notes,图片比例改为 3:4
# daily-douyin-content 配图阶段
优先调用 longform-visual-notes,图片比例改为 16:9
```
## 依赖
| 依赖 | 说明 | 必需 |
|------|------|------|
| qingyun-api | gemini-3-pro-image 生图接口 | ✅(首选) |
| image_generate | OpenClaw原生生图 | ✅(兜底) |
| 内容文章 | 输入的长文 | ✅ |
## 质量标准
- [ ] 每张图包含清晰可读的中文文字
- [ ] 图片信息密度高(非空洞装饰)
- [ ] 有作者标注(如原文有作者)
- [ ] 无"AI生成"等水印
- [ ] 3-5张图覆盖文章核心内容
- [ ] prompt文件已保存(可复用)
- [ ] 图片保存到指定素材目录
## 铁律
1. **中文文字优先** — 图片必须含中文,清晰可读
2. **禁止AI水印** — 不得出现"由xx生成"等文字
3. **作者必须标注** — 原文有作者则图底部标明
4. **高信息密度** — 每张图都是知识载体,不是装饰
5. **4模块Prompt** — 生成提示词严格按 Main/Content/Context/Quality 结构
6. **优先qingyun** — gemini-3-pro-image文字生成质量最高,优先使用
7. **保存Prompt** — 每张图的提示词保存为.md文件,便于复用
## 更新日志
- **v1.0.0** (2026-04-16): 初始版本
- 4模块Prompt结构
- 4种视觉风格(手写笔记/思维导图/架构图/对比矩阵)
- qingyun-api gemini-3-pro-image 优先
- daily系列skills集成指南Related Skills
meeting-notes
>
wemp-operator
> 微信公众号全功能运营——草稿/发布/评论/用户/素材/群发/统计/菜单/二维码 API 封装
zsxq-smart-publish
Publish and manage content on 知识星球 (zsxq.com). Supports talk posts, Q&A, long articles, file sharing, digest/bookmark, homework tasks, and tag management. Use when publishing content to 知识星球, creating/editing posts, uploading files/images/audio, managing digests, batch publishing, or formatting content for 知识星球.
zoom-automation
Automate Zoom meeting creation, management, recordings, webinars, and participant tracking via Rube MCP (Composio). Always search tools first for current schemas.
zoho-crm-automation
Automate Zoho CRM tasks via Rube MCP (Composio): create/update records, search contacts, manage leads, and convert leads. Always search tools first for current schemas.
ziliu-publisher
字流(Ziliu) - AI驱动的多平台内容分发工具。用于一次创作、智能适配排版、一键分发到16+平台(公众号/知乎/小红书/B站/抖音/微博/X等)。当用户需要多平台发布、内容排版、格式适配时使用。触发词:字流、ziliu、多平台发布、一键分发、内容分发、排版发布。
zhihu-post-skill
> 知乎文章发布——知乎平台内容创作与发布自动化
zendesk-automation
Automate Zendesk tasks via Rube MCP (Composio): tickets, users, organizations, replies. Always search tools first for current schemas.
youtube-knowledge-extractor
This skill performs deep analysis of YouTube videos through **both information channels** Multimodal YouTube video analysis through both audio (transcript) and visual (frame extraction + image analysis) channels. Especially powerful for HowTo videos, tutorials, demos, and explainer videos where what is SHOWN (screenshots, UI demos, diagrams, code, physical actions) is just as important as what is SAID. Use this skill whenever a user wants to analyze, summarize, or create step-by-step guides from YouTube videos, or when they share a YouTube URL and want to understand what happens in the video. Triggers on requests like "Analyze this YouTube video", "Create a step-by-step guide from this video", "What does this video show?", "Summarize this tutorial", or any YouTube URL shared with analysis intent.
youtube-factory
Generate complete YouTube videos from a single prompt - script, voiceover, stock footage, captions, thumbnail. Self-contained, no external modules. 100% free tools.
youtube-automation
Automate YouTube tasks via Rube MCP (Composio): upload videos, manage playlists, search content, get analytics, and handle comments. Always search tools first for current schemas.
xlsx
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, .tsv, etc) for: (1) Creating new spreadsheets with formulas and formatting, (2) Reading or analyzing data, (3) Modify existing spreadsheets while preserving formulas, (4) Data analysis and visualization in spreadsheets, or (5) Recalculating formulas