openai-whisper

Local speech-to-text with the Whisper CLI (no API key).

40 stars

Best use case

openai-whisper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Local speech-to-text with the Whisper CLI (no API key).

Teams using openai-whisper should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/openai-whisper/SKILL.md --create-dirs "https://raw.githubusercontent.com/lllooollpp/clawdbot-cn/main/skills/openai-whisper/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/openai-whisper/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How openai-whisper Compares

Feature / Agentopenai-whisperStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Local speech-to-text with the Whisper CLI (no API key).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Whisper (CLI)

Use `whisper` to transcribe audio locally.

Quick start
- `whisper /path/audio.mp3 --model medium --output_format txt --output_dir .`
- `whisper /path/audio.m4a --task translate --output_format srt`

Notes
- Models download to `~/.cache/whisper` on first run.
- `--model` defaults to `turbo` on this install.
- Use smaller models for speed, larger for accuracy.

Related Skills

openai-whisper-api

40
from lllooollpp/clawdbot-cn

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

openai-image-gen

40
from lllooollpp/clawdbot-cn

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

mijia-control

40
from lllooollpp/clawdbot-cn

Control and monitor Xiaomi Mijia smart home devices. Use this skill when the user wants to: 1) Switch device status (on/off, brightness, etc.) 2) List available home devices 3) Run automation scenes 4) Check environmental statistics.

Smart Home & IoT

weather

40
from lllooollpp/clawdbot-cn

Get current weather and forecasts (no API key required).

wacli

40
from lllooollpp/clawdbot-cn

Send WhatsApp messages to other people or search/sync WhatsApp history via the wacli CLI (not for normal user chats).

voice-call

40
from lllooollpp/clawdbot-cn

Start voice calls via the Clawdbot voice-call plugin.

video-frames

40
from lllooollpp/clawdbot-cn

Extract frames or short clips from videos using ffmpeg.

```skill

40
from lllooollpp/clawdbot-cn

---

things-mac

40
from lllooollpp/clawdbot-cn

Manage Things 3 via the `things` CLI on macOS (add/update projects+todos via URL scheme; read/search/list from the local Things database). Use when a user asks Clawdbot to add a task to Things, list inbox/today/upcoming, search tasks, or inspect projects/areas/tags.

spotify-player

40
from lllooollpp/clawdbot-cn

Terminal Spotify playback/search via spogo (preferred) or spotify_player.

sonoscli

40
from lllooollpp/clawdbot-cn

Control Sonos speakers (discover/status/play/volume/group).

songsee

40
from lllooollpp/clawdbot-cn

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.