video-to-claude

Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".

16 stars

Best use case

video-to-claude is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".

Teams using video-to-claude should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/video-to-claude/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/tools/video-to-claude/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/video-to-claude/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How video-to-claude Compares

Feature / Agentvideo-to-claudeStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# video-to-claude

Use Gemini's video understanding to extract build instructions from any video.

## Quick Start

```bash
python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py <source> [flags]
```

## Flags

- `--micro` - Detailed micro-interaction specs (exact colors, timing, easing)
- `--pro` - Use Gemini Pro instead of Flash (4x cost, better quality)

## Examples

```bash
# YouTube tutorial
python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py "https://youtube.com/watch?v=..."

# UI micro-interaction (detailed specs)
python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py --micro button.gif

# Complex product demo (best quality)
python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py --pro "https://youtube.com/watch?v=..."
```

## Supported Sources

- YouTube: `https://youtube.com/watch?v=...` (native, no download)
- Local files: `.mp4`, `.mov`, `.webm`, `.gif`
- Direct URLs: `video.twimg.com`, `cloudfront.net`, etc.
- Tweets: `https://x.com/user/status/...` (requires yt-dlp)

## Output

Returns JSON with:
- `product_name` - What was shown
- `description` - What it does
- `features` - List of capabilities
- `workflow` - Step-by-step flow
- `technical_notes` - Stack, APIs, libraries
- `skill_instructions` - How to build it

For `--micro` mode, also includes:
- Exact dimensions, colors (hex), typography
- Animation timing (ms), easing curves (cubic-bezier)
- State transitions and keyframes

## Environment

Requires `GEMINI_API_KEY` in environment.

## Cost

- Flash (default): ~$0.05/min of video
- Pro (`--pro`): ~$0.20/min of video

Related Skills

wan-t2v-video

16
from diegosouzapw/awesome-omni-skill

Build WAN 2.2 Text-to-Video workflows — dual hi-lo models, lightning LoRAs, VACE modules, and KSamplerAdvanced two-pass

video-testimonial

16
from diegosouzapw/awesome-omni-skill

Capture authentic customer testimonials through guided self-recording workflows, from outreach and briefing to recording and publishing. Use when: Building social proof for marketing campaigns; Creating customer success stories; Gathering testimonials for website and ads; Producing case study videos; Collecting user-generated content

video-sourcing

16
from diegosouzapw/awesome-omni-skill

Run the Video Sourcing Agent with deterministic, concise chat UX for /video_sourcing using a pinned self-bootstrap runtime.

video-outreach

16
from diegosouzapw/awesome-omni-skill

When the user wants to build video-first cold outreach, create personalized video at scale, implement async selling, or use AI demo generation for prospecting. Also use when the user mentions 'video outreach,' 'personalized video,' 'video prospecting,' 'Tavus,' 'Sendspark,' 'HeyGen,' 'video email,' 'async selling,' 'video demo,' or 'made this for you.' This skill covers video-first outreach systems from personalization through conversion optimization.

skill-claude-code

16
from diegosouzapw/awesome-omni-skill

Use when the user asks to run Claude Code CLI (claude) for code analysis, refactoring, debugging, or automated edits, or references Claude Code workflows, sessions, permissions, or CLI flags.

ppt-to-video-workflow

16
from diegosouzapw/awesome-omni-skill

PPT/슬라이드를 나레이션과 자막이 포함된 영상으로 변환합니다. PPTX 파일 또는 slides.json에서 슬라이드 이미지를 추출/렌더링하고, TTS로 나레이션을 생성하며, 자막을 추가하여 최종 MP4 영상을 만듭니다. "PPT를 영상으로 만들어줘", "발표 영상 생성", "자막 포함 영상 만들기" 요청 시 사용합니다.

omega-claude-cli

16
from diegosouzapw/awesome-omni-skill

Use when the user wants to use Claude (Claude Code CLI) for analysis or brainstorming. Triggers on "use Claude", "ask Claude", "analyze with Claude", "brainstorm with Claude".

oatda-video-status

16
from diegosouzapw/awesome-omni-skill

Use when the user wants to check the status of an asynchronous video generation task from OATDA. Returns the video URL when generation is complete.

developing-claude-code-plugins

16
from diegosouzapw/awesome-omni-skill

Build, test, and distribute Claude Code plugins with slash commands, agents, skills, hooks, MCP servers, and LSP servers. MUST be loaded when creating, reviewing, debugging, or distributing plugins. Use PROACTIVELY when user mentions plugins, extensions, marketplaces, or wants to add custom commands/agents to Claude Code.

claude-skill-creator

16
from diegosouzapw/awesome-omni-skill

Guide for creating effective Claude Code skills with proper YAML frontmatter, directory structure, and best practices. Use when creating new skills, updating existing skills, or learning about skill development.

claude-settings-audit

16
from diegosouzapw/awesome-omni-skill

Analyze a repository to generate recommended Claude Code settings.json permissions. Use when setting up a new project, auditing existing settings, or determining which read-only bash commands to allow. Detects tech stack, build tools, and monorepo structure.

claude-plan

16
from diegosouzapw/awesome-omni-skill

Anthropic 風格的專業計畫管理 Skill - 自然語言驅動、預設並行、全自動 GitHub 整合