lecture-transcript-slide-matcher

Combines YouTube lecture transcripts with PDF slides to create an interactive HTML page. Matches each slide to corresponding transcript segments, organized by key concepts. Use when users want to create synchronized lecture notes from transcript text files and slide PDFs.

16 stars

bydiegosouzapw

View on GitHub Installation ↓

Best use case

lecture-transcript-slide-matcher is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using lecture-transcript-slide-matcher should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/lecture-transcript-slide-matcher/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/development/lecture-transcript-slide-matcher/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/lecture-transcript-slide-matcher/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How lecture-transcript-slide-matcher Compares

Feature / Agent	lecture-transcript-slide-matcher	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Lecture Transcript and Slide Matcher

Combines YouTube lecture transcripts (txt files) with corresponding PDF slides to create an interactive HTML page with synchronized content organized by key concepts.

## Overview

This skill processes lecture materials and generates an HTML page with:
1. Left-hand table of contents (TOC) with key concepts
2. Main content area with slides and transcript segments for each concept
3. Automatic transcript cleaning (removes fillers, formats paragraphs)
4. Visual separation between sections

## Workflow

The matching process involves these steps:

1. **Convert transcript** - Standardize timestamp format in transcript
2. **Analyze content** - Extract information from transcript and PDF
3. **Create mapping** - Match concepts to slides and transcript segments
4. **Generate HTML** - Produce the final interactive page

## Step 1: Covert transcript

Run the conversion script to standardize the transcript timestamp format:

```bash
python scripts/convert_transcript.py <transcript_input.txt> <transcript_output.pdf>
```

This script:
- Reads timestamps from separate lines
- Converts them to [MM:SS] or [H:MM:SS] format
- Attaches timestamps inline with text
- Outputs a new transcript text file

## Step 2: Analyze Content

Run the analysis script to understand the lecture materials:

```bash
python scripts/analyze_content.py <transcript.txt> <slides.pdf> [output_analysis.json]
```

This script:
- Parses all transcript segments with timestamps
- Extracts text previews from each PDF slide
- Creates a mapping template
- Outputs `content_analysis.json` with all information

**What to do:**
1. Run the analysis script
2. Review the output JSON file
3. Examine transcript segments and slide previews
4. Identify the key concepts in the lecture

## Step 3: Create Mapping

Create a `mapping.json` file that connects concepts to slides and transcript segments.

**Option A: Let Claude create the mapping**

After running the analysis script, ask Claude to create the mapping by providing:
- The `content_analysis.json` file
- The original transcript file (for full text)
- Instructions on how to identify key concepts

Claude will analyze the content and create a comprehensive mapping.

**Option B: Manual creation**

Use the template in `content_analysis.json` as a starting point. See `references/mapping_schema.md` for complete documentation.

### Mapping Structure

```json
[
  {
    "title": "Key concept or insight",
    "slide_indices": [0, 1, 2],
    "transcript_segments": [
      {
        "start_time": "MM:SS or HH:MM:SS",
        "end_time": "MM:SS or HH:MM:SS",
        "text": "Full transcript text from this time range"
      }
    ]
  }
]
```

**Key points:**
- Use 0-based indexing for slides (first slide = 0)
- Timestamps must match format in transcript: `[HH:MM:SS]` or `[MM:SS]`
- Include full transcript text, not summaries
- Each TOC item represents one coherent concept
- Multiple slides and transcript segments can map to one concept

See `references/mapping_schema.md` for detailed schema documentation and examples.

## Step 4: Generate HTML

Run the generation script to create the final HTML page:

```bash
python scripts/match_lecture_content.py <transcript.txt> <slides.pdf> <mapping.json> [output.html]
```

The script:
- Parses the transcript and extracts all segments
- Converts PDF pages to images (embedded as base64)
- Reads the mapping JSON
- Generates an interactive HTML page with:
  - Left panel with TOC (clickable navigation)
  - Main area with sections for each concept
  - Slides displayed as images
  - Cleaned and formatted transcript segments
  - Visual separation between sections

**Output:** `lecture_output.html` (or specified filename)

## Transcript Format Requirements

The transcript must use timestamp markers:

```
[00:15] Welcome to today's lecture on machine learning.
[00:45] We'll start by discussing supervised learning...
[02:30] Now let's look at an example with house prices...
```

Supported timestamp formats:
- `[HH:MM:SS]` - Hours, minutes, seconds
- `[MM:SS]` - Minutes, seconds
- `[H:MM:SS]` - Single-digit hours

## Automatic Transcript Cleaning

The script automatically:
- Removes filler words (um, uh, like, you know, etc.)
- Removes conversational artifacts ([inaudible], [laughter], etc.)
- Condenses multiple spaces
- Breaks text into readable paragraphs (50 words per paragraph)
- Displays only start and end timestamps for continuous segments

## HTML Output Features

### Table of Contents (Left Panel)
- Clickable items for navigation
- Highlights current section on scroll
- Fixed width, scrollable
- Responsive (collapses on mobile)

### Content Area
- One section per TOC item
- Section title as header
- Slides displayed as images
- Transcript segments below slides
- Time range badges for each segment
- Visual separators between sections
- Smooth scrolling

### Styling
- Clean, professional appearance
- Blue accent colors
- Readable typography
- Shadow effects for slides
- Highlighted transcript containers

## Best Practices

### Identifying Key Concepts

**Good concept granularity:**
- "Linear Regression: Mathematical Formulation"
- "Gradient Descent Algorithm"
- "Neural Networks: Forward Propagation"

**Too broad:**
- "Machine Learning Overview" (entire lecture)

**Too narrow:**
- "Definition of Theta" (single term)

### Creating Effective Mappings

1. **One concept per TOC item**: Each entry should represent one coherent idea
2. **Logical ordering**: Follow lecture sequence
3. **Complete coverage**: Include all major concepts
4. **Accurate alignment**: Ensure slides and transcript truly correspond
5. **Full transcript text**: Don't summarize; include everything from the time range

### Handling Edge Cases

**Concept spans non-contiguous slides:**
```json
{
  "title": "Example: Housing Price Prediction",
  "slide_indices": [5, 8, 12],
  "transcript_segments": [...]
}
```

**Multiple transcript segments per concept:**
```json
{
  "title": "Backpropagation",
  "slide_indices": [15],
  "transcript_segments": [
    {"start_time": "20:00", "end_time": "22:30", "text": "..."},
    {"start_time": "23:00", "end_time": "25:45", "text": "..."}
  ]
}
```

**No slides for a concept (discussion only):**
```json
{
  "title": "Q&A: Common Misconceptions",
  "slide_indices": [],
  "transcript_segments": [...]
}
```

## Dependencies

The scripts require PyMuPDF for PDF processing:

```bash
pip install pymupdf --break-system-packages
```

Claude handles installation automatically when needed.

## Example Usage

Complete workflow example:

```bash
# Step 1: Analyze
python scripts/analyze_content.py lecture.txt slides.pdf analysis.json

# Step 2: Create mapping (manually or with Claude's help)
# Edit analysis.json or create new mapping.json

# Step 3: Generate HTML
python scripts/match_lecture_content.py lecture.txt slides.pdf mapping.json output.html
```

## Reference Files

- `references/mapping_schema.md` - Complete JSON schema documentation with examples
- `references/example_mapping.json` - Sample mapping for a machine learning lecture

## Troubleshooting

**"PyMuPDF not installed"**
Run: `pip install pymupdf --break-system-packages`

**Timestamps don't match**
Ensure timestamps in mapping.json exactly match those in the transcript file.

**Slides not displaying**
Verify slide_indices are 0-based (first slide = 0, not 1).

**Text looks messy**
The cleaning is automatic. If issues persist, check for unusual formatting in the transcript.

**Missing concepts**
Review the analysis output to ensure all relevant transcript segments and slides are covered.

Related Skills

baoyu-slide-deck

from diegosouzapw/awesome-omni-skill

Generates professional slide deck images from content. Creates outlines with style instructions, then generates individual slide images. Use when user asks to "create slides", "make a presentation", "generate deck", "slide deck", or "PPT".

awkn-slide-deck

from diegosouzapw/awesome-omni-skill

Generate professional slide deck images from content. Creates comprehensive outlines with style instructions, then generates individual slide images. Use when user asks to "create slides", "make a presentation", "generate deck", or "slide deck".

keynote-slides

from diegosouzapw/awesome-omni-skill

Build Keynote-style single-file HTML slide decks with brand-ready templates, minimal navigation, and Gemini nano banana media generation. Includes Narrative Engine integration for framework-driven deck creation with 17 proven storytelling structures and 5-agent review panel. Use when creating or editing slide decks, transforming content into presentations, or generating slide visuals.

frontend-slides

from diegosouzapw/awesome-omni-skill

Create stunning, animation-rich HTML presentations from scratch or by converting PowerPoint files. Use when the user wants to build a presentation, convert a PPT/PPTX to web, or create slides for a...

fetching-youtube-transcripts

from diegosouzapw/awesome-omni-skill

Fetch transcripts and subtitles from YouTube videos using youtube-transcript-api. Use when extracting video transcripts, listing available languages, translating captions, or processing YouTube content for summarization or analysis.

bio-epitranscriptomics-modification-visualization

from diegosouzapw/awesome-omni-skill

Create metagene plots and browser tracks for RNA modification data. Use when visualizing m6A distribution patterns around genomic features like stop codons.

aspose-slides

from diegosouzapw/awesome-omni-skill

Comprehensive skill for manipulating Microsoft PowerPoint presentations using Aspose.Slides.NET library with modern C# patterns

marp-slide

from diegosouzapw/awesome-omni-skill

Create professional Marp presentation slides with 7 beautiful themes (default, minimal, colorful, dark, gradient, tech, business). Use when users request slide creation, presentations, or Marp documents. Supports custom themes, image layouts, and "make it look good" requests with automatic quality improvements.

bio-spatial-transcriptomics-spatial-domains

from diegosouzapw/awesome-omni-skill

Identify spatial domains and tissue regions in spatial transcriptomics data using Squidpy and Scanpy. Cluster spots considering both expression and spatial context to define anatomical regions. Use when identifying tissue domains or spatial regions.

whisper-transcription

from diegosouzapw/awesome-omni-skill

Transcribe audio and video files to text using OpenAI Whisper. Use when: converting podcasts to blog posts; creating video subtitles; extracting quotes from interviews; repurposing video content to text; building searchable audio archives

azure-ai-transcription-py

from diegosouzapw/awesome-omni-skill

Azure AI Transcription SDK for Python. Use for real-time and batch speech-to-text transcription with timestamps and diarization.

amazon-order-matcher

from diegosouzapw/awesome-omni-skill

Scrape Amazon order history and match to Monarch Money transactions for auto-categorization. Uses browser automation to extract order details, then matches by amount and date to categorize uncategorized transactions.