audio-transcribe

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

7 stars

byDemerzels-lab

View on GitHub Installation ↓

Best use case

audio-transcribe is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Teams using audio-transcribe should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/audio-transcribe/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/aktheknight/audio-transcribe/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/audio-transcribe/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How audio-transcribe Compares

Feature / Agent	audio-transcribe	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# audio-transcribe

Auto-transcribe voice messages using faster-whisper (local, no API key needed).

## Install

```
npx clawhub@latest install audio-transcribe
```

Related Skills

gettr-transcribe-summarize

from Demerzels-lab/elsamultiskillagent

Download audio from a GETTR post (via HTML og:video), transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT), and summarize the transcript into bullet points and/or a timestamped outline. Use when given a GETTR post URL and asked to produce a transcript or summary.

transcribe

from Demerzels-lab/elsamultiskillagent

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

iyeque-audio-processing

from Demerzels-lab/elsamultiskillagent

Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).

audio-processing

from Demerzels-lab/elsamultiskillagent

Audio ingestion, analysis, transformation, and generation (Transcribe, TTS, VAD, Features).

transcribee

from Demerzels-lab/elsamultiskillagent

Transcribe YouTube videos and local audio/video files with speaker diarization. Use when user asks to transcribe a YouTube URL, podcast, video, or audio file. Outputs clean speaker-labeled transcripts ready for LLM analysis.

mlx-audio-server

from Demerzels-lab/elsamultiskillagent

A fast, accurate, and fully local OpenAI-compatible API server for speech-to-text and text-to-speech, powered by MLX on Apple Silicon and open-source models.

eachlabs-voice-audio

from Demerzels-lab/elsamultiskillagent

TTS, STT, voice conversion using ElevenLabs, Whisper, RVC.

audio-visualization

from Demerzels-lab/elsamultiskillagent

Generate audio visualization videos using each::sense AI.

voice-transcribe

from Demerzels-lab/elsamultiskillagent

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

webchat-audio-notifications

from Demerzels-lab/elsamultiskillagent

Add browser audio notifications to Moltbot/Clawdbot webchat with 5 intensity levels - from whisper to impossible-to-miss (only when tab is backgrounded).

paylock

from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.