multimedia-to-obsidian

将任意多媒体文档导入 Obsidian 知识库。支持 PPT、PDF、DOCX、图片等格式，自动提取每一页/每一张图片，使用多模态模型理解内容，生成文字描述后存入 OB。适用于：(1) 整理培训课件 (2) 迁移笔记到 OB (3) 将图片资料转为结构化知识。

3,891 stars

byopenclaw

View on GitHub Installation ↓

Best use case

multimedia-to-obsidian is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using multimedia-to-obsidian should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/multimedia-to-obsidian/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/aidescend/multimedia-to-obsidian/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/multimedia-to-obsidian/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How multimedia-to-obsidian Compares

Feature / Agent	multimedia-to-obsidian	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Marketing

Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.

AI Agents for Startups

Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

SKILL.md Source

# 多媒体导入 Obsidian

将 PPT、PDF、DOCX、图片等多媒体文档导入 Obsidian，每页/每张图片自动生成文字描述。

## 功能特点

- 支持多种格式：PPT、PDF、DOCX、图片（PNG、JPG 等）
- 自动提取媒体文件
- 调用多模态模型理解内容
- 自动分类到 OB 仓库

## 使用方法

### 前提条件

1. 安装依赖：
```bash
brew install pandoc poppler
pip install python-pptx pillow
```

2. 设置环境变量（至少一个）：
```bash
# MiniMax
export MINIMAX_API_KEY="your-key"
export MINIMAX_API_HOST="https://api.minimaxi.com"

# OpenAI
export OPENAI_API_KEY="your-key"

# Anthropic
export ANTHROPIC_API_KEY="your-key"
```

### 调用脚本

```bash
python3 <skill>/scripts/media_to_obsidian.py <源文件/目录> <输出OB目录> [options]
```

**参数：**
- 源文件/目录：单个文件或包含多媒体文件的目录
- 输出目录：Obsidian 仓库路径
- 选项：
  - `--format ppt|pdf|docx|image|all` : 指定格式，默认 all
  - `--model minimax|openai|anthropic` : 指定模型，默认 minimax
  - `--category 分类名` : 指定输出分类目录

### 示例

```bash
# 导入 PPT 课件
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
  /path/to/培训课件.pptx \
  /path/to/Obsidian \
  --format ppt --category 培训

# 导入图片到 OB
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
  /path/to/images/ \
  /path/to/Obsidian \
  --format image --category 素材

# 批量导入目录下的所有文档
python3 ~/.openclaw/workspace/skills/multimedia-to-obsidian/scripts/media_to_obsidian.py \
  /path/to/documents/ \
  /path/to/Obsidian
```

## 输出结构

```
Obsidian仓库/
├── 培训/
│   └── 培训课件.md (含每页图片理解)
├── 素材/
│   └── 图片1.md
│   └── 图片2.md
└── 导入/
    └── 文档名.md
```

## 工作流程

1. **提取媒体**：从 PPT/PDF/DOCX 提取每一页为图片
2. **理解内容**：调用多模态模型理解每张图片
3. **生成描述**：将图片描述写入 Markdown
4. **分类存储**：按指定分类存入 OB

Related Skills

obsidian

3891

from openclaw/skills

Work with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.

Workflow & Productivity

Obsidian CLI 探索记录

3891

from openclaw/skills

Skill for the official Obsidian CLI (v1.12+). Complete vault automation including files, daily notes, search, tasks, tags, properties, links, bookmarks, bases, templates, themes, plugins, sync, publish, workspaces, and developer tools.

Workflow & Productivity

obsidian-notes

3891

from openclaw/skills

Work with Obsidian vaults (plain Markdown notes) and automate via obsidian-cli.

Workflow & Productivity

obsidian-viz

3891

from openclaw/skills

Generate Obsidian-compatible visualization files (Excalidraw / Mermaid / Canvas). Supports text descriptions and image inputs, outputs editable diagrams in Obsidian or standard formats.

Content & Documentation

conversation-recap-to-obsidian

3891

from openclaw/skills

Build high-value Obsidian daily and weekly review notes from conversation or existing markdown notes. Use this skill whenever the user asks to summarize the current chat into Obsidian, append a structured session recap, regenerate a daily summary from a full daily note, create or refresh a weekly report, merge same-topic work across multiple days, group work by project/task instead of by date, or extract structured review notes with conclusions, key points, tags, and wikilinks.

obsidian-daily

3891

from openclaw/skills

Manage Obsidian Daily Notes via obsidian-cli. Create and open daily notes, append entries (journals, logs, tasks, links), read past notes by date, and search vault content. Handles relative dates like "yesterday", "last Friday", "3 days ago". Requires obsidian-cli installed via Homebrew (Mac/Linux) or Scoop (Windows).

obsidian-canvas-creator

3891

from openclaw/skills

Create Obsidian Canvas files from text content, supporting both MindMap and freeform layouts. Use this skill when users want to visualize content as an interactive canvas, create mind maps, or organize information spatially in Obsidian format.

obsidian-fns

3891

from openclaw/skills

Control a remote Obsidian vault through Fast Note Sync. Use when reading, searching, writing, or appending notes in Obsidian from OpenClaw, especially for remote vault workflows that do not have direct filesystem access.

obsidian-organizer

3891

from openclaw/skills

Organize and standardize Obsidian vaults for reliability and long-term maintainability. Use when designing or cleaning vault folder structure, enforcing file naming conventions, migrating messy vaults, reducing duplicate/ambiguous notes, or creating repeatable audit-and-fix workflows for Obsidian notes.

obsidian-sync

3891

from openclaw/skills

Sync files between Clawdbot workspace and Obsidian. Run the sync server to enable two-way file synchronization with the OpenClaw Obsidian plugin.

ai-research-to-obsidian

3891

from openclaw/skills

用 AI 工具（豆包/Kimi/ChatGPT）搜索问题并将结果整理保存为 Obsidian 文档。触发场景：(1) 用户要求用 AI 搜索问题 (2) 用户要求浏览器搜索并保存到 Obsidian (3) 用户说"帮我查一下"并提到保存到笔记/文档/Obsidian

obsidian-cli-official

3891

from openclaw/skills

Official Obsidian CLI (v1.12+). Complete command-line interface for Obsidian notes, tasks, search, tags, properties, links, and more.