videodb-skills
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Best use case
videodb-skills is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Teams using videodb-skills should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/videodb-skills/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How videodb-skills Compares
| Feature / Agent | videodb-skills | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# VideoDB Skills ## Purpose The only video skill your agent needs. Upload any video, connect real-time streams, search inside by what was said or shown, build complex editing workflows with overlays, generate AI media, add subtitles, and get instant streaming links — all via the VideoDB Python SDK. ## When to Use This Skill - User wants to upload and process videos from YouTube, URLs, or local files - User needs to search for moments by speech or visual scenes - User asks for transcription, subtitles, or subtitle styling - User wants to edit clips — trim, combine, add text/image/audio overlays - User needs AI-generated media (images, video, music, sound effects, voiceovers) - User wants to transcode, change resolution, or reframe for social platforms - User needs real-time screen or audio capture with AI transcription - User asks for playable streaming links for any video output ## Setup ### Step 1: Install the skill ```bash npx skills add video-db/skills ``` ### Step 2: Run setup ``` /videodb setup ``` The agent guides API key setup ($20 free credits, no credit card), installs the SDK, and verifies the connection. Alternatively, set the API key manually: ```bash export VIDEO_DB_API_KEY=sk-xxx ``` ### Step 3: Install the SDK ```bash pip install "videodb[capture]" python-dotenv ``` ## Capabilities | Capability | Description | | ----------- | ------------------------------------------------------------------------- | | Upload | Ingest videos from YouTube, URLs, or local files | | Search | Find moments by speech (semantic/keyword) or visual scenes | | Transcripts | Generate timestamped transcripts from any video | | Edit | Combine clips, trim, add text/image/audio overlays | | Subtitles | Auto-generate and style subtitles | | AI Generate | Create images, video, music, sound effects, and voiceovers from text | | Capture | Real-time screen and audio capture with AI transcription | | Transcode | Change resolution, quality, aspect ratio, or reframe for social platforms | | Stream | Get playable HLS links for anything you build | ## Examples **Upload and transcribe:** ``` "Upload https://www.youtube.com/watch?v=FgrO9ADPZSA and give me a transcript" ``` **Search across videos:** ``` "Search for 'product demo' in my latest video" ``` **Add subtitles:** ``` "Add subtitles with white text on black background" ``` **Multi-clip editing:** ``` "Take clips from 10s-30s and 45s-60s, add a title card, and combine them" ``` **AI media generation:** ``` "Generate background music and overlay it on my video" ``` **Real-time capture:** ``` "Capture my screen and transcribe it in real-time" ``` **Reframe for social:** ``` "Convert this to vertical for Instagram Reels" ``` ## Repository https://github.com/video-db/skills **Version:** 1.1.0 **Maintained By:** [VideoDB](https://github.com/video-db)
Related Skills
writing-skills
Use when creating, updating, or improving agent skills.
videodb
Video and audio perception, indexing, and editing. Ingest files/URLs/live streams, build visual/spoken indexes, search with timestamps, edit timelines, add overlays/subtitles, generate media, and create real-time alerts.
ui-skills
Opinionated, evolving constraints to guide agents when building interfaces
threejs-skills
Create 3D scenes, interactive experiences, and visual effects using Three.js. Use when user requests 3D graphics, WebGL experiences, 3D visualizations, animations, or interactive 3D elements.
nanobanana-ppt-skills
AI-powered PPT generation with document analysis and styled images
makepad-skills
Makepad UI development skills for Rust apps: setup, patterns, shaders, packaging, and troubleshooting.
claude-scientific-skills
Scientific research and analysis skills
aws-skills
AWS development with infrastructure automation and cloud architecture patterns
zustand-store-ts
Create Zustand stores with TypeScript, subscribeWithSelector middleware, and proper state/action separation. Use when building React state management, creating global stores, or implementing reacti...
zoom-automation
Automate Zoom meeting creation, management, recordings, webinars, and participant tracking via Rube MCP (Composio). Always search tools first for current schemas.
zoho-crm-automation
Automate Zoho CRM tasks via Rube MCP (Composio): create/update records, search contacts, manage leads, and convert leads. Always search tools first for current schemas.
zendesk-automation
Automate Zendesk tasks via Rube MCP (Composio): tickets, users, organizations, replies. Always search tools first for current schemas.