audio-processing
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
Best use case
audio-processing is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
Teams using audio-processing should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/audio-processing/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How audio-processing Compares
| Feature / Agent | audio-processing | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# audio-processing Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features). ## Install ``` npx clawhub@latest install audio-processing ```
Related Skills
iyeque-audio-processing
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
mlx-audio-server
A fast, accurate, and fully local OpenAI-compatible API server for speech-to-text and text-to-speech, powered by MLX on Apple Silicon and open-source models.
eachlabs-voice-audio
TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.
audio-visualization
Generate audio visualization videos using each::sense AI.
webchat-audio-notifications
Add browser audio notifications to Moltbot/Clawdbot webchat with 5 intensity levels - from whisper to impossible-to-miss (only when tab is backgrounded).
Context Optimizer & Task Processing Skills Package
## Overview
audio-transcribe
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
paylock
Non-custodial SOL escrow for AI agent deals.
agent-reputation
summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.
Telecom Agent Skill
Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.
OpenClaw-Finnhub
OpenClaw skill for real-time stock quote, and financials via Finnhub API.
```markdown
# OpenClaw-Last.fm