videodb-skills
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Best use case
videodb-skills is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Teams using videodb-skills should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/videodb-skills/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How videodb-skills Compares
| Feature / Agent | videodb-skills | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# VideoDB Skills ## Purpose The only video skill your agent needs. Upload any video, connect real-time streams, search inside by what was said or shown, build complex editing workflows with overlays, generate AI media, add subtitles, and get instant streaming links — all via the VideoDB Python SDK. ## When to Use This Skill - User wants to upload and process videos from YouTube, URLs, or local files - User needs to search for moments by speech or visual scenes - User asks for transcription, subtitles, or subtitle styling - User wants to edit clips — trim, combine, add text/image/audio overlays - User needs AI-generated media (images, video, music, sound effects, voiceovers) - User wants to transcode, change resolution, or reframe for social platforms - User needs real-time screen or audio capture with AI transcription - User asks for playable streaming links for any video output ## Setup ### Step 1: Install the skill ```bash npx skills add video-db/skills ``` ### Step 2: Run setup ``` /videodb setup ``` The agent guides API key setup ($20 free credits, no credit card), installs the SDK, and verifies the connection. Alternatively, set the API key manually: ```bash export VIDEO_DB_API_KEY=sk-xxx ``` ### Step 3: Install the SDK ```bash pip install "videodb[capture]" python-dotenv ``` ## Capabilities | Capability | Description | | ----------- | ------------------------------------------------------------------------- | | Upload | Ingest videos from YouTube, URLs, or local files | | Search | Find moments by speech (semantic/keyword) or visual scenes | | Transcripts | Generate timestamped transcripts from any video | | Edit | Combine clips, trim, add text/image/audio overlays | | Subtitles | Auto-generate and style subtitles | | AI Generate | Create images, video, music, sound effects, and voiceovers from text | | Capture | Real-time screen and audio capture with AI transcription | | Transcode | Change resolution, quality, aspect ratio, or reframe for social platforms | | Stream | Get playable HLS links for anything you build | ## Examples **Upload and transcribe:** ``` "Upload https://www.youtube.com/watch?v=FgrO9ADPZSA and give me a transcript" ``` **Search across videos:** ``` "Search for 'product demo' in my latest video" ``` **Add subtitles:** ``` "Add subtitles with white text on black background" ``` **Multi-clip editing:** ``` "Take clips from 10s-30s and 45s-60s, add a title card, and combine them" ``` **AI media generation:** ``` "Generate background music and overlay it on my video" ``` **Real-time capture:** ``` "Capture my screen and transcribe it in real-time" ``` **Reframe for social:** ``` "Convert this to vertical for Instagram Reels" ``` ## Repository https://github.com/video-db/skills **Version:** 1.1.0 **Maintained By:** [VideoDB](https://github.com/video-db)
Related Skills
ui-skills
Opinionated, evolving constraints to guide agents when building interfaces
threejs-skills
Create 3D scenes, interactive experiences, and visual effects using Three.js. Use when user requests 3D graphics, WebGL experiences, 3D visualizations, animations, or interactive 3D elements.
videodb
Video and audio perception, indexing, and editing. Ingest files/URLs/live streams, build visual/spoken indexes, search with timestamps, edit timelines, add overlays/subtitles, generate media, and create real-time alerts.
nanobanana-ppt-skills
AI-powered PPT generation with document analysis and styled images
writing-skills
Use when creating, updating, or improving agent skills.
makepad-skills
Makepad UI development skills for Rust apps: setup, patterns, shaders, packaging, and troubleshooting.
aws-skills
AWS development with infrastructure automation and cloud architecture patterns
claude-scientific-skills
Scientific research and analysis skills
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
firecrawl
Official Firecrawl CLI skill for web scraping, search, crawling, and browser automation. Returns clean LLM-optimized markdown. USE FOR: - Web search and research - Scraping pages, docs, and articles - Site mapping and bulk content extraction - Browser automation for interactive pages Must be pre-installed and authenticated. See rules/install.md for setup, rules/security.md for output handling.
super-search
Search your coding memory. Use when user asks about past work, previous sessions, how something was implemented, what they worked on before, or wants to recall information from earlier sessions.
super-save
Save important project knowledge to memory. Use when user wants to preserve architectural decisions, significant bug fixes, design patterns, or important implementation details for team reference.