video-caption-creation
Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.
Best use case
video-caption-creation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.
Teams using video-caption-creation should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/video-caption-creation/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How video-caption-creation Compares
| Feature / Agent | video-caption-creation | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Create optimized captions and hooks for short-form video (TikTok, Reels, Shorts) using proven hook categories and platform-specific Triple Word Score optimization system.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Video Caption Writer
## Purpose
This skill creates algorithm-optimized captions and on-screen text for short-form video content using the "Triple Word Score" system to maximize reach and engagement.
**Core Philosophy:** Content serves the topic first. When you serve the topic correctly, algorithms will serve your content to people interested in that topic.
## When to Use This Skill
Use this skill when you need to:
- Create on-screen text hooks for video clips (3-5 options per clip)
- Write captions for short-form video (TikTok, Instagram, YouTube Shorts, etc.)
- Optimize video for algorithm discovery (topic identification in first 10 seconds)
- Generate strategic hashtag sets (10-12, broad to specific)
**Do NOT use for:**
- Text-only social posts (use `social-content-creation` skill instead)
- Long-form YouTube videos (separate workflow)
- Podcast show notes (separate podcast workflow)
---
## The Triple Word Score System
Algorithms and humans need to immediately understand your content through FOUR signals:
### 1. Audio Transcript (MOST IMPORTANT)
- **What you say out loud** - algorithms auto-transcribe this
- Explicitly say topic words in first 10 seconds
- Repeat core terminology throughout
- Use natural speech patterns with topic-relevant vocabulary
### 2. On-Screen Text Reinforcement
- **Visual overlay** that reinforces spoken words
- Lead with topic words in first text overlay
- Use bold, clear terminology
- Don't compete with audio - reinforce it
### 3. Caption Copy Optimization
- **Post description** with topic-relevant keywords
- Open with topic-relevant phrases in first sentence
- Include natural keyword variations
- Provide context for algorithm understanding
### 4. Strategic Hashtag System
- **10-12 hashtags** (optimal range)
- Mix broad and specific (#Productivity + #TimeManagement)
- Include audience-specific (#Entrepreneurs)
- Span from general to niche
**When all four signals align, algorithms AND humans immediately recognize: "This is for me."**
---
## The 3-Step Workflow
### Step 1: Generate On-Screen Text Hooks
**Goal:** Create 3-5 hook options that stop the scroll in first 3 seconds
#### The McDonald's Test
> "Someone who works at McDonald's should be able to read and understand your headline instantly."
- Avoid fancy vocabulary ("optimization paradigm" vs "work smarter not harder")
- Choose accessible language over impressive terminology
- Wider net = better performance
#### Hook Categories
**1. Polarizing Statements**
- "Your [perceived weakness] is actually your superpower"
- "Stop chasing [common goal everyone pursues]"
- "[Respected institution] operates like [controversial comparison]"
**2. Counter-Intuitive Reveals**
- "My first attempt failed. Perfect."
- "The worst experiences are the best teachers"
- "Everyone has genius hiding somewhere"
**3. Direct Challenges**
- "Never give up on the outliers"
- "Don't judge people by their [surface trait]"
- "Make your audience love [topic] again"
**4. Curiosity Gaps**
- "Her boyfriend freaked out..." (for dramatic content)
- "The third tip will blow your mind"
- Use carefully - can create confusion if unclear
#### Generate 3-5 Options
For each video clip, create file: `[Clip Name]_Captions.md`
**Format:**
```markdown
# [Clip Name] - Caption Options
## Source Material
[Brief description of video content, key quote, or main message]
## On-Screen Text Hook Options (First 3 Seconds):
**Option 1:** Your biggest weakness is actually your superpower
- **Framework:** Polarizing Statement
**Option 2:** Stop chasing perfection. Try this instead.
- **Framework:** Direct Challenge
**Option 3:** The one thing experts get wrong about creativity
- **Framework:** Curiosity Gap
**Option 4:** Most advice teaches compliance. This teaches results.
- **Framework:** Contrast/Counter-Intuitive
**Option 5:** 3 reasons your "bad habit" might be genius
- **Framework:** Numbered List + Counter-Intuitive
**RECOMMENDED:** Option 1 (Polarizing) for Short-Form, Option 4 (Contrast) for LinkedIn
```
#### Quality Check for Hooks
- [ ] Would this make ME stop scrolling?
- [ ] Creates curiosity or emotion in 2 seconds?
- [ ] Can my grandmother understand instantly?
- [ ] Hints at payoff without giving it away?
- [ ] Passes McDonald's Test (accessible language)?
---
### Step 2: Create Platform-Specific Captions
**Goal:** Write optimized captions for each target platform
### Short-Form Video Platforms (Identical Treatment)
**Applies to:** YouTube Shorts, Instagram Reels, TikTok, Facebook Reels
- **Character Limit:** 2,200 (use 50-150 characters optimal)
- **Hashtags:** 3-5 (include platform tag: #Shorts for YouTube)
- **Tone:** Casual, energetic, hook in first few words
- **Example:** "Your biggest weakness might be your superpower 💡 #Productivity #Mindset #Growth"
**Key:** Hook immediately, use emojis, keep text scannable
---
### LinkedIn (Separate Strategy)
**Platform-Specific Guidelines:**
- **Character Limit:** 3,000 (use 150-300 characters optimal)
- **Hashtags:** 3-5
- **Tone:** Professional yet conversational, value-driven
- **Format:** Hook → insight → CTA (save, comment, share)
- **Example:** "Your so-called 'weakness' might be teaching you more than any course. Constraints build creativity and resourcefulness. #Productivity #Mindset #GrowthMindset"
**Note:** For LinkedIn posts, use the `social-content-creation` skill instead of this skill.
#### Platform-Specific Caption Template
```markdown
## Short-Form Video Captions
### YouTube Shorts / Instagram Reels / TikTok / Facebook Reels
Your biggest weakness is actually your superpower 💡
Traditional advice teaches compliance. This teaches results.
#Productivity #Mindset #Growth #Success
---
### LinkedIn (Use social-content-creation skill)
Your so-called "weakness" might be teaching you more than any course.
Constraints force creativity, problem-solving, and resourcefulness. Traditional advice teaches compliance. Real growth comes from embracing what makes you different.
#Productivity #Mindset #GrowthMindset
```
**Reference for detailed platform guidelines:** `references/platform-guidelines.md`
---
### Step 3: Apply Algorithm Optimization
**Goal:** Ensure content is discoverable and serves the right audience
#### Topic Identification Checklist
**First 10 Seconds (Critical):**
- [ ] Topic words spoken explicitly in audio
- [ ] On-screen text reinforces topic words
- [ ] Visual elements support topic (environment, materials, context)
- [ ] Energy/delivery matches topic expectations
**Throughout Video:**
- [ ] Core terminology repeated naturally
- [ ] Topic focus maintained (not scattered across multiple topics)
- [ ] Value delivered on topic promise
**Caption & Metadata:**
- [ ] Caption opens with topic-relevant phrase
- [ ] Natural keyword variations included
- [ ] Hashtags span broad to specific (10-12 total)
- [ ] Description provides context for algorithm
#### Strategic Hashtag System
**Formula: Broad → Mid → Specific → Niche**
**Example for productivity content:**
**Broad (1-2):** #Productivity #Success
**Mid (2-3):** #TimeManagement #PersonalDevelopment #WorkSmarter
**Specific (3-4):** #DeepWork #FocusHacks #MorningRoutine #ProductivityTips
**Niche (2-3):** #SolopreneurLife #RemoteWorkTips #DigitalNomad
**Audience (1-2):** #Entrepreneurs #Freelancers
**Platform (1):** #Shorts (for YouTube), #Reels (for Instagram)
**Total: 10-12 hashtags**
#### Common Mistakes to Avoid
**❌ Misleading Signal Trap**
- Topic says "education" but content is really entertainment
- ✅ Fix: Topic signals match actual content value
**❌ Buried Lede**
- Topic words don't appear until middle of content
- ✅ Fix: Topic identification happens in first 10 seconds
**❌ Scatter Shot**
- Trying to cover multiple topics in one piece
- ✅ Fix: Laser focus on one clear topic per piece
**❌ Brand First Approach**
- Making content about your brand/company
- ✅ Fix: Making content about the topic, featuring your expertise
---
## Output Format
### Standard Output File: `[Clip Name]_Captions.md`
```markdown
# [Clip Name] - Captions & Optimization
## Video Details
**Duration:** [XX seconds]
**Core Message:** [1-2 sentence summary]
**Topic:** [Primary topic for algorithm]
**Target Audience:** [Who this serves]
---
## On-Screen Text Options (Choose 1)
**Option 1:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
**Option 2:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
**Option 3:** [Text]
- Category: [Hook category]
- Platform fit: [Platforms]
**RECOMMENDED:** [Which option and why]
---
## Platform-Specific Captions
### TikTok
[Caption text with emojis]
[Hashtags: 3-5]
---
### Instagram Reels
[Caption text with emojis]
[Hashtags: 5-10]
---
### YouTube Shorts
[Caption text]
[Hashtags: 3-5 including #Shorts]
---
### LinkedIn
[Caption text - more professional tone]
[Hashtags: 3-5]
---
### Facebook
[Caption text with emojis]
[Hashtags: 2-5]
---
## Algorithm Optimization
**Topic Keywords (spoken in first 10 seconds):** [List keywords]
**Secondary Keywords (throughout):** [List keywords]
**Hashtag Set (10-12 total):**
- Broad: [2 tags]
- Mid: [3 tags]
- Specific: [4 tags]
- Niche: [3 tags]
**First 10 Second Audio Script:**
"[Exact words that will be spoken - must include topic keywords]"
---
## Quality Checks
- [ ] McDonald's Test passed (accessible language)
- [ ] Topic clear in first 3 seconds
- [ ] On-screen text reinforces (not competes with) audio
- [ ] Platform character limits followed
- [ ] Hashtag counts appropriate per platform
- [ ] Triple Word Score complete (audio, on-screen, caption, hashtags)
```
---
## Bundled Resources
This skill references:
### Detailed Guidelines
- `references/platform-guidelines.md` - Platform-specific rules and best practices
- `references/hook-categories.md` - Detailed hook category examples and when to use each
### Style & Voice
- `voice-pirate-wires` skill - Authentic voice principles
- `anti-ai-writing` skill - Humanize captions that sound too polished
---
## Success Metrics
A successful video caption:
✅ **Stops the scroll** - Hook creates curiosity/emotion in first 3 seconds
✅ **Passes McDonald's Test** - Accessible to broad audience
✅ **Triple Word Score complete** - Audio, on-screen, caption, hashtags all aligned
✅ **Topic-first** - Algorithm knows what this is about immediately
✅ **Platform-optimized** - Character limits, hashtag counts, voice appropriate
✅ **Discoverable** - Strategic hashtags span broad to niche
---
## Related Skills
- `social-content-creation` - For text-only social posts (LinkedIn, X/Twitter, Facebook)
- `youtube-title-creator` - Pair captions with optimized video titles
- `hook-and-headline-writing` - Foundational hook strategies
---
## Common Pitfalls to Avoid
**Content Issues:**
❌ Jargon or fancy vocabulary in hooks
❌ Giving away the payoff in the hook
❌ On-screen text competes with (instead of reinforces) audio
❌ Generic hooks everyone uses
**Algorithm Issues:**
❌ Topic words not spoken in first 10 seconds
❌ Too many topics in one video
❌ Hashtags all broad or all niche (not spanning range)
❌ Misleading hooks that don't match content
**Platform Issues:**
❌ Same caption copy across all platforms
❌ Too many hashtags (LinkedIn, X) or too few (Instagram)
❌ Wrong tone for platform (too casual for LinkedIn, too formal for TikTok)
❌ External links where they hurt reach (Facebook main post)
---
## Version History
- **v1.0** (2025-10-27): Initial skill creation
- Triple Word Score system
- 4 hook categories
- Platform-specific guidelines for 6 platforms
- Algorithm optimization checklist
---
*For platform updates or new hook categories, add to references folder and note in version history*Related Skills
x-launch-video-structure
When the user is planning, scripting, or editing a product launch video for X (Twitter) and needs the structure. Also use when the user mentions "launch video", "demo video", "product launch on X", "60 second demo", "how to structure a launch", or "my launch video isn't working". Produces a beat-by-beat timing sheet, not copy.
video-generator
Generate AI videos using Google VEO 3.1 or OpenAI Sora. Two providers for different strengths - VEO for native audio, Sora for visual quality and longer clips.
social-content-creation
Transform newsletters, articles, and podcasts into platform-optimized social posts using 180+ proven templates. This skill uses the framework fitting method to extract standalone concepts, match them to proven formats, generate volume, and multiply winners. Use for repurposing long-form content to LinkedIn, Twitter/X, Facebook, and Instagram.
remotion-video
Create programmatic videos using Remotion (React). This skill provides visual style guidelines, animation patterns, and workflow for creating explainer videos with a risograph aesthetic. Use when creating short-form video content for Skill Stack.
x-viral-template-miner
When the user wants to find proven-to-travel post templates in their niche and adapt them to their own product. Also use when the user mentions "what's going viral in my space", "what are competitors posting", "copy a viral post", "trending on X", "post ideas", "template mining", or "what to post this week". This is trend hunting, not plagiarism — the output is a template the user fills with their own assets.
x-linkedin-content-relay
When the user has X (Twitter) content that performed well and wants to relay it to LinkedIn 1-2 weeks later with reframing. Also use when the user mentions "repost to LinkedIn", "LinkedIn version of my tweet", "X to LinkedIn", "delayed repost", "LinkedIn for non-tech audience", or "LinkedIn relay". Also use when the user's ICP is non-tech and X is secondary — LinkedIn is the primary channel and this skill produces the content.
x-account-warmup
When a user wants to grow an X (Twitter) account from zero before a product launch, or asks how to get first followers, warm up the algorithm, hit ~500-1,000 followers, or prepare an account to make a launch video land. Also use when the user mentions "new X account", "warm up my Twitter", "first 1000 followers", "building in public strategy", "X growth", or "engagement before launch".
skill-stack-thumbnails
Generate blog post thumbnails for Skill Stack using the brand aesthetic. Follows an iterative workflow - brainstorm concepts, get approval, generate with Gemini API.
youtube-ingest
Transcribe YouTube videos and playlists using Gemini Flash
web-scrape
Scrape web pages to clean markdown with optional AI summaries
voice-tyler-cowen
Write in Tyler Cowen's style - matter-of-fact, understated, treats enormous ideas as obvious observations. Read the passages. Absorb the flatness. Channel the HOW, not the content.
voice-trung-phan
Generate tweets and threads in the style of Trung Phan. Not just voice — captures his humor mechanics, format taxonomy, topic selection filter, and structural patterns. Use for trend-reactive tweets, meme commentary, and business/culture threads.