video-caption-creation

Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.

8 stars

Best use case

video-caption-creation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.

Teams using video-caption-creation should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/video-caption-creation/SKILL.md --create-dirs "https://raw.githubusercontent.com/cdeistopened/skill-stack/main/.claude/skills/video-caption-creation/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/video-caption-creation/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How video-caption-creation Compares

Feature / Agentvideo-caption-creationStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Video Caption Writer

## Purpose

This skill creates algorithm-optimized captions and on-screen text for short-form video content using the "Triple Word Score" system to maximize reach and engagement.

**Core Philosophy:** Content serves the topic first. When you serve the topic correctly, algorithms will serve your content to people interested in that topic.

## When to Use This Skill

Use this skill when you need to:
- Create on-screen text hooks for video clips (3-5 options per clip)
- Write captions for short-form video (TikTok, Instagram, YouTube Shorts, etc.)
- Optimize video for algorithm discovery (topic identification in first 10 seconds)
- Generate strategic hashtag sets (10-12, broad to specific)

**Do NOT use for:**
- Text-only social posts (use `social-content-creation` skill instead)
- Long-form YouTube videos (separate workflow)
- Podcast show notes (separate podcast workflow)

---

## The Triple Word Score System

Algorithms and humans need to immediately understand your content through FOUR signals:

### 1. Audio Transcript (MOST IMPORTANT)
- **What you say out loud** - algorithms auto-transcribe this
- Explicitly say topic words in first 10 seconds
- Repeat core terminology throughout
- Use natural speech patterns with topic-relevant vocabulary

### 2. On-Screen Text Reinforcement
- **Visual overlay** that reinforces spoken words
- Lead with topic words in first text overlay
- Use bold, clear terminology
- Don't compete with audio - reinforce it

### 3. Caption Copy Optimization
- **Post description** with topic-relevant keywords
- Open with topic-relevant phrases in first sentence
- Include natural keyword variations
- Provide context for algorithm understanding

### 4. Strategic Hashtag System
- **10-12 hashtags** (optimal range)
- Mix broad and specific (#Productivity + #TimeManagement)
- Include audience-specific (#Entrepreneurs)
- Span from general to niche

**When all four signals align, algorithms AND humans immediately recognize: "This is for me."**

---

## The 3-Step Workflow

### Step 1: Generate On-Screen Text Hooks

**Goal:** Create 3-5 hook options that stop the scroll in first 3 seconds

#### The McDonald's Test
> "Someone who works at McDonald's should be able to read and understand your headline instantly."

- Avoid fancy vocabulary ("optimization paradigm" vs "work smarter not harder")
- Choose accessible language over impressive terminology
- Wider net = better performance

#### Hook Categories

**1. Polarizing Statements**
- "Your [perceived weakness] is actually your superpower"
- "Stop chasing [common goal everyone pursues]"
- "[Respected institution] operates like [controversial comparison]"

**2. Counter-Intuitive Reveals**
- "My first attempt failed. Perfect."
- "The worst experiences are the best teachers"
- "Everyone has genius hiding somewhere"

**3. Direct Challenges**
- "Never give up on the outliers"
- "Don't judge people by their [surface trait]"
- "Make your audience love [topic] again"

**4. Curiosity Gaps**
- "Her boyfriend freaked out..." (for dramatic content)
- "The third tip will blow your mind"
- Use carefully - can create confusion if unclear

#### Generate 3-5 Options

For each video clip, create file: `[Clip Name]_Captions.md`

**Format:**

```markdown
# [Clip Name] - Caption Options

## Source Material
[Brief description of video content, key quote, or main message]

## On-Screen Text Hook Options (First 3 Seconds):

**Option 1:** Your biggest weakness is actually your superpower
- **Framework:** Polarizing Statement

**Option 2:** Stop chasing perfection. Try this instead.
- **Framework:** Direct Challenge

**Option 3:** The one thing experts get wrong about creativity
- **Framework:** Curiosity Gap

**Option 4:** Most advice teaches compliance. This teaches results.
- **Framework:** Contrast/Counter-Intuitive

**Option 5:** 3 reasons your "bad habit" might be genius
- **Framework:** Numbered List + Counter-Intuitive

**RECOMMENDED:** Option 1 (Polarizing) for Short-Form, Option 4 (Contrast) for LinkedIn
```

#### Quality Check for Hooks
- [ ] Would this make ME stop scrolling?
- [ ] Creates curiosity or emotion in 2 seconds?
- [ ] Can my grandmother understand instantly?
- [ ] Hints at payoff without giving it away?
- [ ] Passes McDonald's Test (accessible language)?

---

### Step 2: Create Platform-Specific Captions

**Goal:** Write optimized captions for each target platform

### Short-Form Video Platforms (Identical Treatment)

**Applies to:** YouTube Shorts, Instagram Reels, TikTok, Facebook Reels

- **Character Limit:** 2,200 (use 50-150 characters optimal)
- **Hashtags:** 3-5 (include platform tag: #Shorts for YouTube)
- **Tone:** Casual, energetic, hook in first few words
- **Example:** "Your biggest weakness might be your superpower 💡 #Productivity #Mindset #Growth"

**Key:** Hook immediately, use emojis, keep text scannable

---

### LinkedIn (Separate Strategy)

**Platform-Specific Guidelines:**

- **Character Limit:** 3,000 (use 150-300 characters optimal)
- **Hashtags:** 3-5
- **Tone:** Professional yet conversational, value-driven
- **Format:** Hook → insight → CTA (save, comment, share)
- **Example:** "Your so-called 'weakness' might be teaching you more than any course. Constraints build creativity and resourcefulness. #Productivity #Mindset #GrowthMindset"

**Note:** For LinkedIn posts, use the `social-content-creation` skill instead of this skill.

#### Platform-Specific Caption Template

```markdown
## Short-Form Video Captions

### YouTube Shorts / Instagram Reels / TikTok / Facebook Reels
Your biggest weakness is actually your superpower 💡

Traditional advice teaches compliance. This teaches results.

#Productivity #Mindset #Growth #Success

---

### LinkedIn (Use social-content-creation skill)
Your so-called "weakness" might be teaching you more than any course.

Constraints force creativity, problem-solving, and resourcefulness. Traditional advice teaches compliance. Real growth comes from embracing what makes you different.

#Productivity #Mindset #GrowthMindset
```

**Reference for detailed platform guidelines:** `references/platform-guidelines.md`

---

### Step 3: Apply Algorithm Optimization

**Goal:** Ensure content is discoverable and serves the right audience

#### Topic Identification Checklist

**First 10 Seconds (Critical):**
- [ ] Topic words spoken explicitly in audio
- [ ] On-screen text reinforces topic words
- [ ] Visual elements support topic (environment, materials, context)
- [ ] Energy/delivery matches topic expectations

**Throughout Video:**
- [ ] Core terminology repeated naturally
- [ ] Topic focus maintained (not scattered across multiple topics)
- [ ] Value delivered on topic promise

**Caption & Metadata:**
- [ ] Caption opens with topic-relevant phrase
- [ ] Natural keyword variations included
- [ ] Hashtags span broad to specific (10-12 total)
- [ ] Description provides context for algorithm

#### Strategic Hashtag System

**Formula: Broad → Mid → Specific → Niche**

**Example for productivity content:**

**Broad (1-2):** #Productivity #Success

**Mid (2-3):** #TimeManagement #PersonalDevelopment #WorkSmarter

**Specific (3-4):** #DeepWork #FocusHacks #MorningRoutine #ProductivityTips

**Niche (2-3):** #SolopreneurLife #RemoteWorkTips #DigitalNomad

**Audience (1-2):** #Entrepreneurs #Freelancers

**Platform (1):** #Shorts (for YouTube), #Reels (for Instagram)

**Total: 10-12 hashtags**

#### Common Mistakes to Avoid

**❌ Misleading Signal Trap**
- Topic says "education" but content is really entertainment
- ✅ Fix: Topic signals match actual content value

**❌ Buried Lede**
- Topic words don't appear until middle of content
- ✅ Fix: Topic identification happens in first 10 seconds

**❌ Scatter Shot**
- Trying to cover multiple topics in one piece
- ✅ Fix: Laser focus on one clear topic per piece

**❌ Brand First Approach**
- Making content about your brand/company
- ✅ Fix: Making content about the topic, featuring your expertise

---

## Output Format

### Standard Output File: `[Clip Name]_Captions.md`

```markdown
# [Clip Name] - Captions & Optimization

## Video Details
**Duration:** [XX seconds]
**Core Message:** [1-2 sentence summary]
**Topic:** [Primary topic for algorithm]
**Target Audience:** [Who this serves]

---

## On-Screen Text Options (Choose 1)

**Option 1:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]

**Option 2:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]

**Option 3:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]

**RECOMMENDED:** [Which option and why]

---

## Platform-Specific Captions

### TikTok
[Caption text with emojis]

[Hashtags: 3-5]

---

### Instagram Reels
[Caption text with emojis]

[Hashtags: 5-10]

---

### YouTube Shorts
[Caption text]

[Hashtags: 3-5 including #Shorts]

---

### LinkedIn
[Caption text - more professional tone]

[Hashtags: 3-5]

---

### Facebook
[Caption text with emojis]

[Hashtags: 2-5]

---

## Algorithm Optimization

**Topic Keywords (spoken in first 10 seconds):** [List keywords]

**Secondary Keywords (throughout):** [List keywords]

**Hashtag Set (10-12 total):**
- Broad: [2 tags]
- Mid: [3 tags]
- Specific: [4 tags]
- Niche: [3 tags]

**First 10 Second Audio Script:**
"[Exact words that will be spoken - must include topic keywords]"

---

## Quality Checks
- [ ] McDonald's Test passed (accessible language)
- [ ] Topic clear in first 3 seconds
- [ ] On-screen text reinforces (not competes with) audio
- [ ] Platform character limits followed
- [ ] Hashtag counts appropriate per platform
- [ ] Triple Word Score complete (audio, on-screen, caption, hashtags)
```

---

## Bundled Resources

This skill references:

### Detailed Guidelines
- `references/platform-guidelines.md` - Platform-specific rules and best practices
- `references/hook-categories.md` - Detailed hook category examples and when to use each

### Style & Voice
- `voice-pirate-wires` skill - Authentic voice principles
- `anti-ai-writing` skill - Humanize captions that sound too polished

---

## Success Metrics

A successful video caption:

✅ **Stops the scroll** - Hook creates curiosity/emotion in first 3 seconds
✅ **Passes McDonald's Test** - Accessible to broad audience
✅ **Triple Word Score complete** - Audio, on-screen, caption, hashtags all aligned
✅ **Topic-first** - Algorithm knows what this is about immediately
✅ **Platform-optimized** - Character limits, hashtag counts, voice appropriate
✅ **Discoverable** - Strategic hashtags span broad to niche

---

## Related Skills

- `social-content-creation` - For text-only social posts (LinkedIn, X/Twitter, Facebook)
- `youtube-title-creator` - Pair captions with optimized video titles
- `hook-and-headline-writing` - Foundational hook strategies

---

## Common Pitfalls to Avoid

**Content Issues:**
❌ Jargon or fancy vocabulary in hooks
❌ Giving away the payoff in the hook
❌ On-screen text competes with (instead of reinforces) audio
❌ Generic hooks everyone uses

**Algorithm Issues:**
❌ Topic words not spoken in first 10 seconds
❌ Too many topics in one video
❌ Hashtags all broad or all niche (not spanning range)
❌ Misleading hooks that don't match content

**Platform Issues:**
❌ Same caption copy across all platforms
❌ Too many hashtags (LinkedIn, X) or too few (Instagram)
❌ Wrong tone for platform (too casual for LinkedIn, too formal for TikTok)
❌ External links where they hurt reach (Facebook main post)

---

## Version History

- **v1.0** (2025-10-27): Initial skill creation
  - Triple Word Score system
  - 4 hook categories
  - Platform-specific guidelines for 6 platforms
  - Algorithm optimization checklist

---

*For platform updates or new hook categories, add to references folder and note in version history*

Related Skills

x-launch-video-structure

8
from cdeistopened/skill-stack

When the user is planning, scripting, or editing a product launch video for X (Twitter) and needs the structure. Also use when the user mentions "launch video", "demo video", "product launch on X", "60 second demo", "how to structure a launch", or "my launch video isn't working". Produces a beat-by-beat timing sheet, not copy.

video-generator

8
from cdeistopened/skill-stack

Generate AI videos using Google VEO 3.1 or OpenAI Sora. Two providers for different strengths - VEO for native audio, Sora for visual quality and longer clips.

social-content-creation

8
from cdeistopened/skill-stack

Transform newsletters, articles, and podcasts into platform-optimized social posts using 180+ proven templates. This skill uses the framework fitting method to extract standalone concepts, match them to proven formats, generate volume, and multiply winners. Use for repurposing long-form content to LinkedIn, Twitter/X, Facebook, and Instagram.

remotion-video

8
from cdeistopened/skill-stack

Create programmatic videos using Remotion (React). This skill provides visual style guidelines, animation patterns, and workflow for creating explainer videos with a risograph aesthetic. Use when creating short-form video content for Skill Stack.

x-viral-template-miner

8
from cdeistopened/skill-stack

When the user wants to find proven-to-travel post templates in their niche and adapt them to their own product. Also use when the user mentions "what's going viral in my space", "what are competitors posting", "copy a viral post", "trending on X", "post ideas", "template mining", or "what to post this week". This is trend hunting, not plagiarism — the output is a template the user fills with their own assets.

x-linkedin-content-relay

8
from cdeistopened/skill-stack

When the user has X (Twitter) content that performed well and wants to relay it to LinkedIn 1-2 weeks later with reframing. Also use when the user mentions "repost to LinkedIn", "LinkedIn version of my tweet", "X to LinkedIn", "delayed repost", "LinkedIn for non-tech audience", or "LinkedIn relay". Also use when the user's ICP is non-tech and X is secondary — LinkedIn is the primary channel and this skill produces the content.

x-account-warmup

8
from cdeistopened/skill-stack

When a user wants to grow an X (Twitter) account from zero before a product launch, or asks how to get first followers, warm up the algorithm, hit ~500-1,000 followers, or prepare an account to make a launch video land. Also use when the user mentions "new X account", "warm up my Twitter", "first 1000 followers", "building in public strategy", "X growth", or "engagement before launch".

skill-stack-thumbnails

8
from cdeistopened/skill-stack

Generate blog post thumbnails for Skill Stack using the brand aesthetic. Follows an iterative workflow - brainstorm concepts, get approval, generate with Gemini API.

youtube-ingest

8
from cdeistopened/skill-stack

Transcribe YouTube videos and playlists using Gemini Flash

web-scrape

8
from cdeistopened/skill-stack

Scrape web pages to clean markdown with optional AI summaries

voice-tyler-cowen

8
from cdeistopened/skill-stack

Write in Tyler Cowen's style - matter-of-fact, understated, treats enormous ideas as obvious observations. Read the passages. Absorb the flatness. Channel the HOW, not the content.

voice-trung-phan

8
from cdeistopened/skill-stack

Generate tweets and threads in the style of Trung Phan. Not just voice — captures his humor mechanics, format taxonomy, topic selection filter, and structural patterns. Use for trend-reactive tweets, meme commentary, and business/culture threads.