transcriber

Fetch and store transcripts from YouTube videos for deep content analysis.

7 stars

Best use case

transcriber is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Fetch and store transcripts from YouTube videos for deep content analysis.

Teams using transcriber should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/transcriber/SKILL.md --create-dirs "https://raw.githubusercontent.com/codata/croissant-toolkit/main/.gemini/skills/transcriber/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/transcriber/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How transcriber Compares

Feature / AgenttranscriberStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Fetch and store transcripts from YouTube videos for deep content analysis.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Transcriber Skill

The Transcriber skill allows the agent to download the text transcript (closed captions) of YouTube videos. This is essential for converting video tutorials and discussions into searchable and indexable text for the Croissant metadata engine.

Transcripts are stored locally in `./data/transcripts/` as plain text files named by their YouTube Video ID.

## Tools

### 1. Transcribe YouTube Video
Fetches the transcript for a single video ID/URL or automatically processes the `youtube_search_results.json` if no arguments are provided.

**Usage:**
```bash
# Transcribe a specific video
python3 transcriber/scripts/transcribe.py <VIDEO_URL_OR_ID>

# Batch transcribe the results from the last Youtuber search
python3 transcriber/scripts/transcribe.py
```

### 2. Fetch Video Metadata
Extracts high-fidelity metadata (Title, Description, Views, Upload Date) in Schema.org JSON-LD format.

**Usage:**
```bash
python3 transcriber/scripts/get_metadata.py <VIDEO_URL_OR_ID>
```

**Output:**
Metadata is saved to `./data/metadata/<VIDEO_ID>.json`.

**Example:**
`python3 transcriber/scripts/transcribe.py https://www.youtube.com/watch?v=6cWcZ2G53gE`

Related Skills

walker

7
from codata/croissant-toolkit

Deep crawl functionality that extracts and visits internal links from a webpage.

orchestrator_expert

7
from codata/croissant-toolkit

Orchestrator agent that has comprehensive knowledge and command over all available skills in this toolkit to create complex workflows.

neo4j_expert

7
from codata/croissant-toolkit

Store and query Croissant datasets in a Neo4j Graph Database for relational discovery and semantic search.

youtuber

7
from codata/croissant-toolkit

Search for videos on YouTube based on specific keywords. Get list of videos with title, description, and URL.

wizard

7
from codata/croissant-toolkit

The ultimate data integrator. Orchestrates transcription, translation, NLP analysis, and Croissant serialization into a single automated pipeline.

unf

7
from codata/croissant-toolkit

Universal Numeric Fingerprint (UNF) generator. For strings, it splits into words and sorts them alphabetically to provide order-invariant fingerprints. Supports dataframes and files too.

translator

7
from codata/croissant-toolkit

Recognize the language of input content or video scripts and translate them precisely into English using Gemini 3.

telegram_expert

7
from codata/croissant-toolkit

Send results and notifications to Telegram channels or users.

rohub

7
from codata/croissant-toolkit

Deposit research objects and add semantic annotations to the RO-Hub portal using the rohub library.

ro-crate-expert

7
from codata/croissant-toolkit

Specialized in creating RO-Crate packages from Dataverse metadata, with integrated ODRL-based DID (Decentralized Identifier) attribution and provenance via the ro-crate-py library.

📊 Presentation Expert Skill

7
from codata/croissant-toolkit

The **Presentation Expert** is responsible for transforming complex research data, metadata, and insights into high-impact presentation decks.

photograph

7
from codata/croissant-toolkit

Captures visual snapshots (screenshots) of web pages and records screen sessions (video).