video-transcriber

视频转写工作流，支持B站和YouTube视频。自动判断有字幕/无字幕，有字幕则获取字幕，无字幕则下载音频+whisper转写。触发场景：(1) 用户要求总结视频内容 (2) 用户要求获取视频字幕 (3) 用户要求转写视频 (4) 处理B站/YouTube视频

33 stars

byaAAaqwq

View on GitHub Installation ↓

Best use case

video-transcriber is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using video-transcriber should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/video-transcriber/SKILL.md --create-dirs "https://raw.githubusercontent.com/aAAaqwq/AGI-Super-Team/main/skills/video-transcriber/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/video-transcriber/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How video-transcriber Compares

Feature / Agent	video-transcriber	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# 视频转写工作流

## 决策流程

```
1. 尝试获取字幕 → bilibili-youtube-watcher
      ↓
   有字幕？ ──是──→ 检查语言 ──需翻译？──是──→ LLM翻译
      ↓否                       ↓否
2. 下载音频 → yt-dlp      直接使用
      ↓
3. 判断语言 → 中文？用--language Chinese : 用--language en
      ↓
4. 转写 → whisper.cpp tiny模型
```

## 工具

### bilibili-youtube-watcher（有字幕时）
```bash
# B站视频字幕
python3 ~/.openclaw/extensions/bilibili-youtube-watcher/scripts/get_transcript.py "URL" --lang zh-CN

# YouTube字幕（尝试en/zh-CN）
python3 ~/.openclaw/extensions/bilibili-youtube-watcher/scripts/get_transcript.py "URL" --lang en
```

### yt-dlp + whisper.cpp（无字幕时）

```bash
# 1. 获取视频时长
yt-dlp --dump-json --no-check-certificate "URL" | jq -r '.duration, .title'

# 2. 下载音频
yt-dlp -f bestaudio --no-check-certificate "URL" -o audio.m4a

# 3. 转码
ffmpeg -i audio.m4a -ar 16000 -ac 1 audio.wav

# 4. 转写
# 中文视频
whisper-cpp -m ggml-tiny.bin -f audio.wav -osrt --language Chinese
# 英文视频
whisper-cpp -m ggml-tiny.bin -f audio.wav -osrt --language English
```

## 注意事项

- exec默认timeout=1800秒（30分钟），足够处理40分钟视频
- 翻译：用LLM直接处理字幕内容
- 字幕库：my.feishu.cn/docx/I9P3dGDZioSJZlxfHwwclXcAnDe

Related Skills

videocut:自进化

from aAAaqwq/AGI-Super-Team

字幕生成与烧录。火山引擎转录→词典纠错→审核→烧录。触发词：加字幕、生成字幕、字幕

video-subtitles

from aAAaqwq/AGI-Super-Team

Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.

video-script

from aAAaqwq/AGI-Super-Team

Create video scripts and publishing materials for YouTubers/UP主. Use when user wants to prepare a video, write a script (口播稿), generate video title, description, tags, or chapter timestamps. Triggers on "写视频脚本", "视频口播稿", "video script", "prepare video", "视频发布素材", or mentions creating content for YouTube/Bilibili.

video-merge-send

from aAAaqwq/AGI-Super-Team

合并多个分段视频为一个完整视频，并发送到飞书。使用ffmpeg拼接视频片段，支持淡入淡出转场。触发场景：用户需要合并视频片段、拼接分镜视频、视频合并后发飞书、把多个短视频合成一个、分段视频合并发送。配合 jimeng-storyboard skill 使用，完成数字人视频全流程。

video-marketing

from aAAaqwq/AGI-Super-Team

When the user wants to plan video marketing, create video scripts, or optimize for short-form or long-form video. Also use when the user mentions "video marketing," "video script," "short-form video," "long-form video," "TikTok script," "Reels script," "YouTube script," "video hook," or "video content strategy."

video-lyrics-subtitle

from aAAaqwq/AGI-Super-Team

Video lyrics subtitle generator — create synchronized subtitle files for music videos

video-generation

from aAAaqwq/AGI-Super-Team

Use this skill when the user requests to generate, create, or imagine videos. Supports structured prompts and reference image for guided generation.

video-content-analyzer

from aAAaqwq/AGI-Super-Team

下载视频并用AI分析内容 - 支持B站/抖音/YouTube等平台，提取语音内容并分析视频结构

video-caption-burner

from aAAaqwq/AGI-Super-Team

Burn existing subtitles or captions directly into video exports so they remain visible across platforms and players. Use when a team already has captions and needs a platform-ready, subtitle-baked video for social, ads, storefronts, or review links.

showcase-video-builder

from aAAaqwq/AGI-Super-Team

Build polished showcase and demo videos from screenshots, avatars, and text overlays using ffmpeg. Use when creating demo reels, hackathon presentations, product walkthroughs, or social media video content from static assets. Requires ffmpeg.

relay-video-gen

from aAAaqwq/AGI-Super-Team

Multi-provider video generation with async polling and automatic model fallback

lyrics-video-sync

from aAAaqwq/AGI-Super-Team

歌词-视频精准匹配引擎。将MP3中的歌词片段按时间轴精准匹配到对应视频clip，支持歌词提取、时间轴对齐、情绪映射、字幕烧录、音频分段混合。触发词：歌词匹配、lyrics sync、歌词卡点、音频对齐、字幕视频、lyrics video、歌词视频、音乐视频字幕