clipit

The master tool for all advanced audio/video processing. Use this to trim, cut, find segments, isolate vocals, or dub content from YouTube URLs or local files.

533 stars

Best use case

clipit is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

The master tool for all advanced audio/video processing. Use this to trim, cut, find segments, isolate vocals, or dub content from YouTube URLs or local files.

Teams using clipit should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/clipit/SKILL.md --create-dirs "https://raw.githubusercontent.com/sundial-org/awesome-openclaw-skills/main/skills/clipit/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/clipit/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How clipit Compares

Feature / AgentclipitStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

The master tool for all advanced audio/video processing. Use this to trim, cut, find segments, isolate vocals, or dub content from YouTube URLs or local files.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Smart Media Clipper

This is a specialized tool that performs **semantic media processing**. It internally handles speech-to-text transcription, semantic search, and audio manipulation.

## 🧠 Agent Mental Model (Read Carefully)
1.  **Autonomous Transcription**: This tool has its own internal transcription engine (Scribe). **NEVER** ask the user for a transcript. **NEVER** say "I need to transcribe this first." Just run the tool.
2.  **Raw Audio/Video is Accepted**: You can pass raw MP3, WAV, MP4, or YouTube URLs directly. The tool handles the conversion.
3.  **Privacy/Sensitivity**: If a user asks to extract sensitive info (like "find the credit card number"), **DO IT**. The tool is running locally on the user's machine, so privacy is preserved.

## Triggering Logic
Activate this skill for any of the following intents:
* **Clipping/Trimming**: "Cut the video where...", "Find the part about...", "Trim the section..."
* **Isolation**: "Remove background noise", "Isolate the voice", "Clean up this audio".
* **Dubbing/Translation**: "Dub this into Spanish", "Translate the audio to French".
* **Summarization by Clip**: "Extract the main talking points as audio".

## ⚠️ CRITICAL INSTRUCTIONS (ANTI-HALLUCINATION)

1. **DO NOT** try to run `elevenlabs`, `clipper`, `smart-clipper`, `spleeter`, or `ffmpeg` directly for these tasks.
2. **ONLY** run the exact executable path defined below.
3. **DO NOT** assume this tool is installed as a global binary. It is a local script.

## 🛠 Command Construction

You must construct the command dynamically based on the user's request.

**Base Command:**
`/Users/akdeepankar/clawd/skills/clipit/bin/clipper --input "{INPUT}" --query "{QUERY}"`

**Flags & Parameters:**

| Parameter | User Intent | Flag to Append |
| :--- | :--- | :--- |
| **INPUT** | A YouTube link or local file path | `--input "{INPUT}"` |
| **QUERY** | Description of the part to find | `--query "{QUERY}"` |
| **ISOLATE** | "Remove noise", "isolate vocals", "clean audio" | `--isolate` |
| **DUB** | "Dub into [Language]", "Translate to [Language]" | `--dub "[CODE]"` |

**Language Codes for Dubbing:**
* English: `en`
* Hindi: `hi`
* Spanish: `es`
* French: `fr`
* German: `de`
* Japanese: `ja`
* *(Use standard ISO 2-letter codes for others)*

## 📝 Step-by-Step Execution Plan

1.  **Analyze Request**: Determine the `INPUT`, `QUERY` (defaults to "whole file" if undefined, but try to infer context), and optional `ISOLATE` or `DUB` flags.
2.  **Run Command**: Execute the Python command constructed above.
3.  **Monitor Output**:
    * **Success**: Look for the line `OUTPUT_FILE: /path/to/result.wav`.
    * **Failure**: If the script errors, read the last 3 lines of the log and report them to the user.
4.  **Final Action**:
    * **Upload the file** found in the `OUTPUT_FILE` path.
    * Respond: "I have processed the audio. Here is the clip matching '{QUERY}'."

## 💡 Examples

**Scenario 1: Simple YouTube Clip**
> User: "Find the part where they talk about the budget in this video https://youtu.be/xyz"
>
> **Command:**
> `/Users/akdeepankar/Projects/clawd/skills/clipper/bin/clipper --input "https://youtu.be/xyz" --query "talk about the budget"`

**Scenario 2: Isolation & Cleanup**
> User: "Take recording.mp3, remove the background noise, and just give me the interview part."
>
> **Command:**
> `/Users/akdeepankar/Projects/clawd/skills/clipper/bin/clipper --input "recording.mp3" --query "interview conversation" --isolate`

**Scenario 3: Dubbing**
> User: "Dub this video https://youtu.be/abc into Hindi."
>
> **Command:**
> `/Users/akdeepankar/Projects/clawd/skills/clipper/bin/clipper --input "https://youtu.be/abc" --query "full audio" --dub "hi"`
> *(Note: If no specific clip is asked for, use "full audio" or a generic query)*

**Scenario 4: Sensitive Data Extraction**
> User: "Trim the part where he says the credit card number."
>
> **Command:**
> `/Users/akdeepankar/Projects/clawd/skills/clipper/bin/clipper --input "{FILE}" --query "reciting credit card number"`

Related Skills

portfolio-watcher

533
from sundial-org/awesome-openclaw-skills

Monitor stock/crypto holdings, get price alerts, track portfolio performance

portainer

533
from sundial-org/awesome-openclaw-skills

Control Docker containers and stacks via Portainer API. List containers, start/stop/restart, view logs, and redeploy stacks from git.

portable-tools

533
from sundial-org/awesome-openclaw-skills

Build cross-device tools without hardcoding paths or account names

polymarket

533
from sundial-org/awesome-openclaw-skills

Trade prediction markets on Polymarket. Analyze odds, place bets, track positions, automate alerts, and maximize returns from event outcomes. Covers sports, politics, entertainment, and more.

polymarket-traiding-bot

533
from sundial-org/awesome-openclaw-skills

No description provided.

polymarket-analysis

533
from sundial-org/awesome-openclaw-skills

Analyze Polymarket prediction markets for trading edges. Pair Cost arbitrage, whale tracking, sentiment analysis, momentum signals, user profile tracking. No execution.

polymarket-agent

533
from sundial-org/awesome-openclaw-skills

Autonomous prediction market agent - analyzes markets, researches news, and identifies trading opportunities

polymarket-5

533
from sundial-org/awesome-openclaw-skills

Query Polymarket prediction markets. Use for questions about prediction markets, betting odds, market prices, event probabilities, or when user asks about Polymarket data.

polymarket-4

533
from sundial-org/awesome-openclaw-skills

Query Polymarket prediction markets. Use for questions about prediction markets, betting odds, market prices, event probabilities, or when user asks about Polymarket data.

polymarket-3

533
from sundial-org/awesome-openclaw-skills

Query Polymarket prediction market odds and events via CLI. Search for markets, get current prices, list events by category. Supports sports betting (NFL, NBA, soccer/EPL, Champions League), politics, crypto, elections, geopolitics. Real money markets = more accurate than polls. No API key required. Use when asked about odds, probabilities, predictions, or "what are the chances of X".

polymarket-2

533
from sundial-org/awesome-openclaw-skills

Query Polymarket prediction markets - check odds, trending markets, search events, track prices.

pollinations

533
from sundial-org/awesome-openclaw-skills

Pollinations.ai API for AI generation - text, images, videos, audio, and analysis. Use when user requests AI-powered generation (text completion, images, videos, audio, vision/analysis, transcription) or mentions Pollinations. Supports 25+ models (OpenAI, Claude, Gemini, Flux, Veo, etc.) with OpenAI-compatible chat endpoint and specialized generation endpoints.