YouTube Transcript Extraction

Extract high-quality transcripts from YouTube videos using multiple methods.

3,891 stars

Best use case

YouTube Transcript Extraction is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Extract high-quality transcripts from YouTube videos using multiple methods.

Teams using YouTube Transcript Extraction should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/yt-transcript/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/alti-systems/yt-transcript/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/yt-transcript/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How YouTube Transcript Extraction Compares

Feature / AgentYouTube Transcript ExtractionStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Extract high-quality transcripts from YouTube videos using multiple methods.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# YouTube Transcript Extraction

Extract high-quality transcripts from YouTube videos using multiple methods.

## Commands

```bash
# Extract transcript from YouTube URL or video ID
/root/clawd/yt-transcript https://youtu.be/VIDEO_ID
/root/clawd/yt-transcript VIDEO_ID
```

## Features

- **Dual fallback system**: Tries Supadata API first, falls back to yt-dlp
- **Auto-generated transcripts**: Works even if no manual captions exist
- **Clean output**: Returns plain text transcript ready for analysis
- **Fast**: API method completes in seconds

## Use Cases

- Summarize long videos without watching
- Extract key quotes and insights
- Content research and analysis
- Create written summaries for videos
- Extract educational content

## Technical Details

- **Primary**: Supadata API (fast, clean formatting)
- **Fallback**: yt-dlp CLI tool (comprehensive, handles edge cases)
- **Output**: Plain text transcript with timestamps removed
- **API key**: Stored in `.env` as `SUPADATA_API_KEY`

## Example Workflow

Rob sends YouTube link → Alto pulls transcript → summarizes key points → Rob decides if worth watching

Saves 10-30 minutes per video!

Related Skills

openclaw-youtube

3891
from openclaw/skills

YouTube SERP Scout for agents. Search top-ranking videos, channels, and trends for content research and competitor tracking.

Content & Documentation

youtube-search

3891
from openclaw/skills

YouTube Search API via AIsa unified endpoint. Search YouTube videos, channels, and playlists with a single AIsa API key — no Google API key or OAuth required. Use this skill when users want to search YouTube content. For other AIsa capabilities (LLM, financial data, Twitter, web search), see the aisa-core skill.

Data & Research

youtube-archiver

3891
from openclaw/skills

Archive YouTube playlists into markdown notes with metadata, transcripts, AI summaries, and tags. Use when a user asks to import/sync YouTube playlists, archive Watch Later or Liked videos, enrich YouTube notes, batch process video notes, or automate recurring YouTube-to-markdown sync jobs with cron.

youtube-digest

3891
from openclaw/skills

Understand, summarize, translate, and extract key points from YouTube videos. Use when a user provides a YouTube URL and wants: (1) a Chinese summary, (2) a transcript or subtitle extraction, (3) translation of spoken content, (4) timestamps / chapter notes, (5) visual understanding via key frames, or (6) question answering about a video. Prefer this skill for transcript-first workflows.

youtube-content-manager

3891
from openclaw/skills

YouTube内容管理后台,支持AI选题生成、脚本创作、标题优化、SEO描述生成、缩略图文案建议、发布记录管理和数据分析。集成SkillPay支付接口,每次调用收0.001USDT。

youtube-content-manager-pro

3891
from openclaw/skills

All-in-one YouTube Content Management Tool, AI generate topics, scripts, titles, SEO descriptions, tags, thumbnails, analytics. $0.005 USDT per use.

youtube-audio-download

3891
from openclaw/skills

Download YouTube video audio and convert to MP3. Supports age-restricted videos with cookies.

banner-youtube-translate-workflow

3891
from openclaw/skills

Complete workflow: download YouTube audio, launch Doubao, play audio, capture translation. Activates when user needs full video translation.

YouTube Channel Scraper

3891
from openclaw/skills

A browser-based YouTube channel discovery and scraping tool.

youtube-anycaption-summarizer

3891
from openclaw/skills

Turn YouTube videos into dependable markdown transcripts and polished summaries — even when caption coverage is messy. This skill works with manual closed captions (CC), auto-generated subtitles, or no usable subtitles at all by using subtitle-first extraction with local Whisper fallback. Supports private/restricted videos via cookies, batch processing, transcript cleanup, language backfill, source-language or user-selected summary language, and end-to-end completion reporting. Ideal for YouTube research, technical walkthroughs, founder content, tutorials, private/internal uploads, and batch video summarization workflows.

adr-decision-extraction

3891
from openclaw/skills

Extract architectural decisions from conversations. Identifies problem-solution pairs, trade-off discussions, and explicit choices. Use when analyzing session transcripts for ADR generation.

Audio Transcription Skill

3891
from openclaw/skills

Auto-transcribe voice messages using faster-whisper (local, no API key needed).