podcast-ingest
Transcribe podcast episodes from RSS feeds using Gemini Flash
Best use case
podcast-ingest is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Transcribe podcast episodes from RSS feeds using Gemini Flash
Teams using podcast-ingest should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/podcast-ingest/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How podcast-ingest Compares
| Feature / Agent | podcast-ingest | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Transcribe podcast episodes from RSS feeds using Gemini Flash
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Podcast Ingest Fetches podcast RSS feeds, downloads audio, and produces polished markdown transcripts via Gemini Flash. ## When to Use - User wants to transcribe a podcast episode or batch of episodes - User provides an RSS feed URL, podcast name, or direct audio URL - Building a markdown knowledge base from podcast content ## Commands ```bash # List episodes from a feed bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --list # Transcribe the latest episode bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --limit 1 # Transcribe latest 5 episodes bun run podcast-ingest/scripts/ingest.ts --feed <rss-url> --limit 5 # Transcribe from direct audio URL bun run podcast-ingest/scripts/ingest.ts --audio <audio-url> --title "Episode Title" ``` ## Requirements - `GEMINI_API_KEY` environment variable - Output goes to `output/podcasts/` by default ## Output Format Each episode produces a markdown file with YAML frontmatter: ```yaml --- title: "Episode Title" podcast: "Podcast Name" author: "Host Name" date: "2026-02-18" source: "https://..." type: "podcast-transcript" --- ``` Followed by: Summary, Topics, Speakers list, and full sectioned transcript. ## Cost ~$0.005 per hour of audio (Gemini Flash). A typical 1-hour episode costs less than a penny.
Related Skills
youtube-ingest
Transcribe YouTube videos and playlists using Gemini Flash
podcast-production
Complete workflow for producing podcast episodes from raw transcript to publishable YouTube and social media assets. Four-checkpoint system for strategic decision-making plus final polished assets.
podcast-blog-post-creator
Transform podcast episodes into compelling, SEO-optimized blog posts in a conversational voice. Creates narrative-driven explainers (~1,000 words) that weave guest expertise with real-world storytelling.
x-viral-template-miner
When the user wants to find proven-to-travel post templates in their niche and adapt them to their own product. Also use when the user mentions "what's going viral in my space", "what are competitors posting", "copy a viral post", "trending on X", "post ideas", "template mining", or "what to post this week". This is trend hunting, not plagiarism — the output is a template the user fills with their own assets.
x-linkedin-content-relay
When the user has X (Twitter) content that performed well and wants to relay it to LinkedIn 1-2 weeks later with reframing. Also use when the user mentions "repost to LinkedIn", "LinkedIn version of my tweet", "X to LinkedIn", "delayed repost", "LinkedIn for non-tech audience", or "LinkedIn relay". Also use when the user's ICP is non-tech and X is secondary — LinkedIn is the primary channel and this skill produces the content.
x-launch-video-structure
When the user is planning, scripting, or editing a product launch video for X (Twitter) and needs the structure. Also use when the user mentions "launch video", "demo video", "product launch on X", "60 second demo", "how to structure a launch", or "my launch video isn't working". Produces a beat-by-beat timing sheet, not copy.
x-account-warmup
When a user wants to grow an X (Twitter) account from zero before a product launch, or asks how to get first followers, warm up the algorithm, hit ~500-1,000 followers, or prepare an account to make a launch video land. Also use when the user mentions "new X account", "warm up my Twitter", "first 1000 followers", "building in public strategy", "X growth", or "engagement before launch".
skill-stack-thumbnails
Generate blog post thumbnails for Skill Stack using the brand aesthetic. Follows an iterative workflow - brainstorm concepts, get approval, generate with Gemini API.
web-scrape
Scrape web pages to clean markdown with optional AI summaries
voice-tyler-cowen
Write in Tyler Cowen's style - matter-of-fact, understated, treats enormous ideas as obvious observations. Read the passages. Absorb the flatness. Channel the HOW, not the content.
voice-trung-phan
Generate tweets and threads in the style of Trung Phan. Not just voice — captures his humor mechanics, format taxonomy, topic selection filter, and structural patterns. Use for trend-reactive tweets, meme commentary, and business/culture threads.
voice-levine-berry
Write in a combined Matt Levine + Wendell Berry voice. Levine's dry logic-walking and parenthetical humor for the analytical sections. Berry's meditative patience for the human ones. Read the passages. Absorb the rhythm. Channel the HOW, not the content.