openai-whisper

Local speech-to-text with the Whisper CLI (no API key).

533 stars

bysundial-org

View on GitHub Installation ↓

Best use case

openai-whisper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Local speech-to-text with the Whisper CLI (no API key).

Teams using openai-whisper should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/openai-whisper/SKILL.md --create-dirs "https://raw.githubusercontent.com/sundial-org/awesome-openclaw-skills/main/skills/openai-whisper/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/openai-whisper/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How openai-whisper Compares

Feature / Agent	openai-whisper	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Local speech-to-text with the Whisper CLI (no API key).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Whisper (CLI)

Use `whisper` to transcribe audio locally.

Quick start
- `whisper /path/audio.mp3 --model medium --output_format txt --output_dir .`
- `whisper /path/audio.m4a --task translate --output_format srt`

Notes
- Models download to `~/.cache/whisper` on first run.
- `--model` defaults to `turbo` on this install.
- Use smaller models for speed, larger for accuracy.

Related Skills

openai-whisper-api

533

from sundial-org/awesome-openclaw-skills

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

openai-tts

533

from sundial-org/awesome-openclaw-skills

Text-to-speech via OpenAI Audio Speech API.

openai-image-gen

533

from sundial-org/awesome-openclaw-skills

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

openai-docs-skill

533

from sundial-org/awesome-openclaw-skills

Query the OpenAI developer documentation via the OpenAI Docs MCP server using CLI (curl/jq). Use whenever a task involves the OpenAI API (Responses, Chat Completions, Realtime, etc.), OpenAI SDKs, ChatGPT Apps SDK, Codex, MCP integrations, endpoint schemas, parameters, limits, or migrations and you need up-to-date official guidance.

mlx-whisper

533

from sundial-org/awesome-openclaw-skills

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

local-whisper

533

from sundial-org/awesome-openclaw-skills

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

llmwhisperer

533

from sundial-org/awesome-openclaw-skills

Extract text and layout from images and PDFs using LLMWhisperer API. Good for handwriting and complex forms.

portfolio-watcher

533

from sundial-org/awesome-openclaw-skills

Monitor stock/crypto holdings, get price alerts, track portfolio performance

portainer

533

from sundial-org/awesome-openclaw-skills

Control Docker containers and stacks via Portainer API. List containers, start/stop/restart, view logs, and redeploy stacks from git.

portable-tools

533

from sundial-org/awesome-openclaw-skills

Build cross-device tools without hardcoding paths or account names

polymarket

533

from sundial-org/awesome-openclaw-skills

Trade prediction markets on Polymarket. Analyze odds, place bets, track positions, automate alerts, and maximize returns from event outcomes. Covers sports, politics, entertainment, and more.

polymarket-traiding-bot

533

from sundial-org/awesome-openclaw-skills

No description provided.