podcast-ingest

Transcribe podcast episodes from RSS feeds using Gemini Flash

8 stars

Best use case

podcast-ingest is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Transcribe podcast episodes from RSS feeds using Gemini Flash

Teams using podcast-ingest should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/podcast-ingest/SKILL.md --create-dirs "https://raw.githubusercontent.com/cdeistopened/skill-stack/main/public/skills/podcast-ingest/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/podcast-ingest/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How podcast-ingest Compares

Feature / Agentpodcast-ingestStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Transcribe podcast episodes from RSS feeds using Gemini Flash

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Podcast Ingest

Fetches podcast RSS feeds, downloads audio, and produces polished markdown transcripts via Gemini Flash.

## When to Use

- User wants to transcribe a podcast episode or batch of episodes
- User provides an RSS feed URL, podcast name, or direct audio URL
- Building a markdown knowledge base from podcast content

## Commands

```bash
# List episodes from a feed
bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --list

# Transcribe the latest episode
bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --limit 1

# Transcribe latest 5 episodes
bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --limit 5

# Transcribe from direct audio URL
bun run podcast-ingest/scripts/ingest.ts --audio <audio-url> --title "Episode Title"
```

## Requirements

- `GEMINI_API_KEY` environment variable
- Output goes to `output/podcasts/` by default

## Output Format

Each episode produces a markdown file with YAML frontmatter:
```yaml
---
title: "Episode Title"
podcast: "Podcast Name"
author: "Host Name"
date: "2026-02-18"
source: "https://..."
type: "podcast-transcript"
---
```

Followed by: Summary, Topics, Speakers list, and full sectioned transcript.

## Cost

~$0.005 per hour of audio (Gemini Flash). A typical 1-hour episode costs less than a penny.

Related Skills

youtube-ingest

8
from cdeistopened/skill-stack

Transcribe YouTube videos and playlists using Gemini Flash

podcast-production

8
from cdeistopened/skill-stack

Complete workflow for producing podcast episodes from raw transcript to publishable YouTube and social media assets. Four-checkpoint system for strategic decision-making plus final polished assets.

podcast-blog-post-creator

8
from cdeistopened/skill-stack

Transform podcast episodes into compelling, SEO-optimized blog posts in a conversational voice. Creates narrative-driven explainers (~1,000 words) that weave guest expertise with real-world storytelling.

x-viral-template-miner

8
from cdeistopened/skill-stack

When the user wants to find proven-to-travel post templates in their niche and adapt them to their own product. Also use when the user mentions "what's going viral in my space", "what are competitors posting", "copy a viral post", "trending on X", "post ideas", "template mining", or "what to post this week". This is trend hunting, not plagiarism — the output is a template the user fills with their own assets.

x-linkedin-content-relay

8
from cdeistopened/skill-stack

When the user has X (Twitter) content that performed well and wants to relay it to LinkedIn 1-2 weeks later with reframing. Also use when the user mentions "repost to LinkedIn", "LinkedIn version of my tweet", "X to LinkedIn", "delayed repost", "LinkedIn for non-tech audience", or "LinkedIn relay". Also use when the user's ICP is non-tech and X is secondary — LinkedIn is the primary channel and this skill produces the content.

x-launch-video-structure

8
from cdeistopened/skill-stack

When the user is planning, scripting, or editing a product launch video for X (Twitter) and needs the structure. Also use when the user mentions "launch video", "demo video", "product launch on X", "60 second demo", "how to structure a launch", or "my launch video isn't working". Produces a beat-by-beat timing sheet, not copy.

x-account-warmup

8
from cdeistopened/skill-stack

When a user wants to grow an X (Twitter) account from zero before a product launch, or asks how to get first followers, warm up the algorithm, hit ~500-1,000 followers, or prepare an account to make a launch video land. Also use when the user mentions "new X account", "warm up my Twitter", "first 1000 followers", "building in public strategy", "X growth", or "engagement before launch".

skill-stack-thumbnails

8
from cdeistopened/skill-stack

Generate blog post thumbnails for Skill Stack using the brand aesthetic. Follows an iterative workflow - brainstorm concepts, get approval, generate with Gemini API.

web-scrape

8
from cdeistopened/skill-stack

Scrape web pages to clean markdown with optional AI summaries

voice-tyler-cowen

8
from cdeistopened/skill-stack

Write in Tyler Cowen's style - matter-of-fact, understated, treats enormous ideas as obvious observations. Read the passages. Absorb the flatness. Channel the HOW, not the content.

voice-trung-phan

8
from cdeistopened/skill-stack

Generate tweets and threads in the style of Trung Phan. Not just voice — captures his humor mechanics, format taxonomy, topic selection filter, and structural patterns. Use for trend-reactive tweets, meme commentary, and business/culture threads.

voice-levine-berry

8
from cdeistopened/skill-stack

Write in a combined Matt Levine + Wendell Berry voice. Levine's dry logic-walking and parenthetical humor for the analytical sections. Berry's meditative patience for the human ones. Read the passages. Absorb the rhythm. Channel the HOW, not the content.