videodb-skills
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Best use case
videodb-skills is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.
Practical example
Example input
Use the "videodb-skills" skill to help with this workflow task. Context: Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Example output
A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.
When to use this skill
- Use this skill when you want a reusable workflow rather than writing the same prompt again and again.
When not to use this skill
- Do not use this when you only need a one-off answer and do not need a reusable workflow.
- Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/videodb-skills/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How videodb-skills Compares
| Feature / Agent | videodb-skills | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agent for YouTube Script Writing
Find AI agent skills for YouTube script writing, video research, content outlining, and repeatable channel production workflows.
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
SKILL.md Source
# VideoDB Skills ## Purpose The only video skill your agent needs. Upload any video, connect real-time streams, search inside by what was said or shown, build complex editing workflows with overlays, generate AI media, add subtitles, and get instant streaming links — all via the VideoDB Python SDK. ## When to Use This Skill - User wants to upload and process videos from YouTube, URLs, or local files - User needs to search for moments by speech or visual scenes - User asks for transcription, subtitles, or subtitle styling - User wants to edit clips — trim, combine, add text/image/audio overlays - User needs AI-generated media (images, video, music, sound effects, voiceovers) - User wants to transcode, change resolution, or reframe for social platforms - User needs real-time screen or audio capture with AI transcription - User asks for playable streaming links for any video output ## Setup ### Step 1: Install the skill ```bash npx skills add video-db/skills ``` ### Step 2: Run setup ``` /videodb setup ``` The agent guides API key setup ($20 free credits, no credit card), installs the SDK, and verifies the connection. Alternatively, set the API key manually: ```bash export VIDEO_DB_API_KEY=sk-xxx ``` ### Step 3: Install the SDK ```bash pip install "videodb[capture]" python-dotenv ``` ## Capabilities | Capability | Description | | ----------- | ------------------------------------------------------------------------- | | Upload | Ingest videos from YouTube, URLs, or local files | | Search | Find moments by speech (semantic/keyword) or visual scenes | | Transcripts | Generate timestamped transcripts from any video | | Edit | Combine clips, trim, add text/image/audio overlays | | Subtitles | Auto-generate and style subtitles | | AI Generate | Create images, video, music, sound effects, and voiceovers from text | | Capture | Real-time screen and audio capture with AI transcription | | Transcode | Change resolution, quality, aspect ratio, or reframe for social platforms | | Stream | Get playable HLS links for anything you build | ## Examples **Upload and transcribe:** ``` "Upload https://www.youtube.com/watch?v=FgrO9ADPZSA and give me a transcript" ``` **Search across videos:** ``` "Search for 'product demo' in my latest video" ``` **Add subtitles:** ``` "Add subtitles with white text on black background" ``` **Multi-clip editing:** ``` "Take clips from 10s-30s and 45s-60s, add a title card, and combine them" ``` **AI media generation:** ``` "Generate background music and overlay it on my video" ``` **Real-time capture:** ``` "Capture my screen and transcribe it in real-time" ``` **Reframe for social:** ``` "Convert this to vertical for Instagram Reels" ``` ## Repository https://github.com/video-db/skills **Version:** 1.1.0 **Maintained By:** [VideoDB](https://github.com/video-db) ## Limitations - Use this skill only when the task clearly matches the scope described above. - Do not treat the output as a substitute for environment-specific validation, testing, or expert review. - Stop and ask for clarification if required inputs, permissions, safety boundaries, or success criteria are missing.
Related Skills
nanobanana-ppt-skills
AI-powered PPT generation with document analysis and styled images
makepad-skills
Makepad UI development skills for Rust apps: setup, patterns, shaders, packaging, and troubleshooting.
claude-scientific-skills
Scientific research and analysis skills
audit-skills
Expert security auditor for AI Skills and Bundles. Performs non-intrusive static analysis to identify malicious patterns, data leaks, system stability risks, and obfuscated payloads across Windows, macOS, Linux/Unix, and Mobile (Android/iOS).
writing-skills
Use when creating, updating, or improving agent skills.
videodb
Video and audio perception, indexing, and editing. Ingest files/URLs/live streams, build visual/spoken indexes, search with timestamps, edit timelines, add overlays/subtitles, generate media, and create real-time alerts.
ui-skills
Opinionated, evolving constraints to guide agents when building interfaces
threejs-skills
Create 3D scenes, interactive experiences, and visual effects using Three.js. Use when user requests 3D graphics, WebGL experiences, 3D visualizations, animations, or interactive 3D elements.
aws-skills
AWS development with infrastructure automation and cloud architecture patterns
nextjs-best-practices
Next.js App Router principles. Server Components, data fetching, routing patterns.
network-101
Configure and test common network services (HTTP, HTTPS, SNMP, SMB) for penetration testing lab environments. Enable hands-on practice with service enumeration, log analysis, and security testing against properly configured target systems.
neon-postgres
Expert patterns for Neon serverless Postgres, branching, connection pooling, and Prisma/Drizzle integration