audio-transcribe
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
Best use case
audio-transcribe is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
Teams using audio-transcribe should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/audio-transcribe/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How audio-transcribe Compares
| Feature / Agent | audio-transcribe | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Auto-transcribe voice messages using faster-whisper (local, no API key needed).
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# audio-transcribe Auto-transcribe voice messages using faster-whisper (local, no API key needed). ## Install ``` npx clawhub@latest install audio-transcribe ```
Related Skills
gettr-transcribe-summarize
Download audio from a GETTR post (via HTML og:video), transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT), and summarize the transcript into bullet points and/or a timestamped outline. Use when given a GETTR post URL and asked to produce a transcript or summary.
transcribe
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.
iyeque-audio-processing
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
audio-processing
Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).
transcribee
Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.
mlx-audio-server
A fast, accurate, and fully local OpenAI-compatible API server for speech-to-text and text-to-speech, powered by MLX on Apple Silicon and open-source models.
eachlabs-voice-audio
TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.
audio-visualization
Generate audio visualization videos using each::sense AI.
voice-transcribe
Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).
webchat-audio-notifications
Add browser audio notifications to Moltbot/Clawdbot webchat with 5 intensity levels - from whisper to impossible-to-miss (only when tab is backgrounded).
paylock
Non-custodial SOL escrow for AI agent deals.
agent-reputation
summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.