video-to-claude
Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".
Best use case
video-to-claude is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".
Teams using video-to-claude should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/video-to-claude/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How video-to-claude Compares
| Feature / Agent | video-to-claude | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Analyze videos with Gemini and get build instructions for Claude. Supports YouTube URLs, GIFs, screen recordings, and local video files. Use when user shares a video/GIF and wants to understand how to build it, clone a UI, reverse engineer a product, or extract tutorial steps. Triggers on "watch this video", "clone this", "reverse engineer", "how do I build this", "analyze this demo".
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# video-to-claude Use Gemini's video understanding to extract build instructions from any video. ## Quick Start ```bash python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py <source> [flags] ``` ## Flags - `--micro` - Detailed micro-interaction specs (exact colors, timing, easing) - `--pro` - Use Gemini Pro instead of Flash (4x cost, better quality) ## Examples ```bash # YouTube tutorial python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py "https://youtube.com/watch?v=..." # UI micro-interaction (detailed specs) python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py --micro button.gif # Complex product demo (best quality) python ~/.claude/skills/video-to-claude/scripts/video_to_claude.py --pro "https://youtube.com/watch?v=..." ``` ## Supported Sources - YouTube: `https://youtube.com/watch?v=...` (native, no download) - Local files: `.mp4`, `.mov`, `.webm`, `.gif` - Direct URLs: `video.twimg.com`, `cloudfront.net`, etc. - Tweets: `https://x.com/user/status/...` (requires yt-dlp) ## Output Returns JSON with: - `product_name` - What was shown - `description` - What it does - `features` - List of capabilities - `workflow` - Step-by-step flow - `technical_notes` - Stack, APIs, libraries - `skill_instructions` - How to build it For `--micro` mode, also includes: - Exact dimensions, colors (hex), typography - Animation timing (ms), easing curves (cubic-bezier) - State transitions and keyframes ## Environment Requires `GEMINI_API_KEY` in environment. ## Cost - Flash (default): ~$0.05/min of video - Pro (`--pro`): ~$0.20/min of video
Related Skills
wan-t2v-video
Build WAN 2.2 Text-to-Video workflows — dual hi-lo models, lightning LoRAs, VACE modules, and KSamplerAdvanced two-pass
video-testimonial
Capture authentic customer testimonials through guided self-recording workflows, from outreach and briefing to recording and publishing. Use when: Building social proof for marketing campaigns; Creating customer success stories; Gathering testimonials for website and ads; Producing case study videos; Collecting user-generated content
video-sourcing
Run the Video Sourcing Agent with deterministic, concise chat UX for /video_sourcing using a pinned self-bootstrap runtime.
video-outreach
When the user wants to build video-first cold outreach, create personalized video at scale, implement async selling, or use AI demo generation for prospecting. Also use when the user mentions 'video outreach,' 'personalized video,' 'video prospecting,' 'Tavus,' 'Sendspark,' 'HeyGen,' 'video email,' 'async selling,' 'video demo,' or 'made this for you.' This skill covers video-first outreach systems from personalization through conversion optimization.
skill-claude-code
Use when the user asks to run Claude Code CLI (claude) for code analysis, refactoring, debugging, or automated edits, or references Claude Code workflows, sessions, permissions, or CLI flags.
ppt-to-video-workflow
PPT/슬라이드를 나레이션과 자막이 포함된 영상으로 변환합니다. PPTX 파일 또는 slides.json에서 슬라이드 이미지를 추출/렌더링하고, TTS로 나레이션을 생성하며, 자막을 추가하여 최종 MP4 영상을 만듭니다. "PPT를 영상으로 만들어줘", "발표 영상 생성", "자막 포함 영상 만들기" 요청 시 사용합니다.
omega-claude-cli
Use when the user wants to use Claude (Claude Code CLI) for analysis or brainstorming. Triggers on "use Claude", "ask Claude", "analyze with Claude", "brainstorm with Claude".
oatda-video-status
Use when the user wants to check the status of an asynchronous video generation task from OATDA. Returns the video URL when generation is complete.
developing-claude-code-plugins
Build, test, and distribute Claude Code plugins with slash commands, agents, skills, hooks, MCP servers, and LSP servers. MUST be loaded when creating, reviewing, debugging, or distributing plugins. Use PROACTIVELY when user mentions plugins, extensions, marketplaces, or wants to add custom commands/agents to Claude Code.
claude-skill-creator
Guide for creating effective Claude Code skills with proper YAML frontmatter, directory structure, and best practices. Use when creating new skills, updating existing skills, or learning about skill development.
claude-settings-audit
Analyze a repository to generate recommended Claude Code settings.json permissions. Use when setting up a new project, auditing existing settings, or determining which read-only bash commands to allow. Detects tech stack, build tools, and monorepo structure.
claude-plan
Anthropic 風格的專業計畫管理 Skill - 自然語言驅動、預設並行、全自動 GitHub 整合