videodb-skills
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Best use case
videodb-skills is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Teams using videodb-skills should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/videodb-skills/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How videodb-skills Compares
| Feature / Agent | videodb-skills | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# VideoDB Skills ## Purpose The only video skill your agent needs. Upload any video, connect real-time streams, search inside by what was said or shown, build complex editing workflows with overlays, generate AI media, add subtitles, and get instant streaming links — all via the VideoDB Python SDK. ## When to Use This Skill - User wants to upload and process videos from YouTube, URLs, or local files - User needs to search for moments by speech or visual scenes - User asks for transcription, subtitles, or subtitle styling - User wants to edit clips — trim, combine, add text/image/audio overlays - User needs AI-generated media (images, video, music, sound effects, voiceovers) - User wants to transcode, change resolution, or reframe for social platforms - User needs real-time screen or audio capture with AI transcription - User asks for playable streaming links for any video output ## Setup ### Step 1: Install the skill ```bash npx skills add video-db/skills ``` ### Step 2: Run setup ``` /videodb setup ``` The agent guides API key setup ($20 free credits, no credit card), installs the SDK, and verifies the connection. Alternatively, set the API key manually: ```bash export VIDEO_DB_API_KEY=sk-xxx ``` ### Step 3: Install the SDK ```bash pip install "videodb[capture]" python-dotenv ``` ## Capabilities | Capability | Description | | ----------- | ------------------------------------------------------------------------- | | Upload | Ingest videos from YouTube, URLs, or local files | | Search | Find moments by speech (semantic/keyword) or visual scenes | | Transcripts | Generate timestamped transcripts from any video | | Edit | Combine clips, trim, add text/image/audio overlays | | Subtitles | Auto-generate and style subtitles | | AI Generate | Create images, video, music, sound effects, and voiceovers from text | | Capture | Real-time screen and audio capture with AI transcription | | Transcode | Change resolution, quality, aspect ratio, or reframe for social platforms | | Stream | Get playable HLS links for anything you build | ## Examples **Upload and transcribe:** ``` "Upload https://www.youtube.com/watch?v=FgrO9ADPZSA and give me a transcript" ``` **Search across videos:** ``` "Search for 'product demo' in my latest video" ``` **Add subtitles:** ``` "Add subtitles with white text on black background" ``` **Multi-clip editing:** ``` "Take clips from 10s-30s and 45s-60s, add a title card, and combine them" ``` **AI media generation:** ``` "Generate background music and overlay it on my video" ``` **Real-time capture:** ``` "Capture my screen and transcribe it in real-time" ``` **Reframe for social:** ``` "Convert this to vertical for Instagram Reels" ``` ## Repository https://github.com/video-db/skills **Version:** 1.1.0 **Maintained By:** [VideoDB](https://github.com/video-db) ## Limitations - Use this skill only when the task clearly matches the scope described above. - Do not treat the output as a substitute for environment-specific validation, testing, or expert review. - Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
Related Skills
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
vercel-react-native-skills
React Native and Expo best practices for building performant mobile apps. Use when building React Native components, optimizing list performance, implementing animations, or working with native modules. Triggers on tasks involving React Native, Expo, mobile performance, or native platform APIs.
manage-skills
Discover, list, create, edit, toggle, copy, move, and delete AI agent skills across 11 tools (Cursor, Claude, Agents, Windsurf, Copilot, Codex, Cline, Aider, Continue, Roo Code, Augment)
audit-skills
Expert security auditor for AI Skills and Bundles. Performs non-intrusive static analysis to identify malicious patterns, data leaks, system stability risks, and obfuscated payloads across Windows, macOS, Linux/Unix, and Mobile (Android/iOS).
writing-skills
Use when creating, updating, or improving agent skills.
videodb
Video and audio perception, indexing, and editing. Ingest files/URLs/live streams, build visual/spoken indexes, search with timestamps, edit timelines, add overlays/subtitles, generate media, and create real-time alerts.
ui-skills
Opinionated, evolving constraints to guide agents when building interfaces
threejs-skills
Create 3D scenes, interactive experiences, and visual effects using Three.js. Use when user requests 3D graphics, WebGL experiences, 3D visualizations, animations, or interactive 3D elements.
nanobanana-ppt-skills
AI-powered PPT generation with document analysis and styled images
makepad-skills
Makepad UI development skills for Rust apps: setup, patterns, shaders, packaging, and troubleshooting.
lambdatest-agent-skills
Production-grade test automation skills for 46 frameworks across E2E, unit, mobile, BDD, visual, and cloud testing in 15+ languages.
claude-scientific-skills
Scientific research and analysis skills