openai-whisper

Local speech-to-text with the Whisper CLI (no API key).

3,891 stars

byopenclaw

View on GitHub Installation ↓

Best use case

openai-whisper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Local speech-to-text with the Whisper CLI (no API key).

Teams using openai-whisper should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/openai-whisper-andy27725/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/andy27725/openai-whisper-andy27725/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/openai-whisper-andy27725/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How openai-whisper Compares

Feature / Agent	openai-whisper	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Local speech-to-text with the Whisper CLI (no API key).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Marketing

Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.

AI Agents for Startups

Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

SKILL.md Source

# Whisper (CLI)

Use `whisper` to transcribe audio locally.

Quick start

- `whisper /path/audio.mp3 --model medium --output_format txt --output_dir .`
- `whisper /path/audio.m4a --task translate --output_format srt`

Notes

- Models download to `~/.cache/whisper` on first run.
- `--model` defaults to `turbo` on this install.
- Use smaller models for speed, larger for accuracy.

Related Skills

local-whisper

3891

from openclaw/skills

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

openai-image-gen

3891

from openclaw/skills

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

openai-auth-switcher-public

3891

from openclaw/skills

Web-first, publishable OpenClaw skill for OpenAI OAuth account switching. Use when you need a reusable public-track workflow for first-run takeover, environment discovery, doctor checks, runtime inspection, slot management, dry-run validation, controlled switch experiments, rollback planning, and release-safe packaging without bundling live auth snapshots, logs, callbacks, or other machine-specific runtime data.

openai-docs-skill

3891

from openclaw/skills

Query the OpenAI developer documentation via the OpenAI Docs MCP server using CLI (curl/jq). Use whenever a task involves the OpenAI API (Responses, Chat Completions, Realtime, etc.), OpenAI SDKs, ChatGPT Apps SDK, Codex, MCP integrations, endpoint schemas, parameters, limits, or migrations and you need up-to-date official guidance.

whisper-gpu-transcribe

3891

from openclaw/skills

Convert audio to SRT subtitles using OpenAI Whisper with automatic GPU acceleration for Intel XPU / NVIDIA CUDA / AMD ROCm / Apple Metal. Ideal for content creators as a free alternative to paid subtitle generation.

whisper-context

3891

from openclaw/skills

Official Whisper Context skill for OpenClaw. Cuts context tokens via delta compression + caching, and adds long-term memory across sessions.

usewhisper-autohook

3891

from openclaw/skills

Auto-hook tools for OpenClaw: query Whisper Context before every generation, ingest after every turn. Built for Telegram agents (stable user_id/session_id).

openai

3891

from openclaw/skills

OpenAI API integration — chat completions, embeddings, image generation, audio transcription, file management, fine-tuning, and assistants via the OpenAI REST API. Generate text, create images with DALL-E, transcribe audio with Whisper, manage fine-tuning jobs, and build AI assistants. Built for AI agents — Python stdlib only, zero dependencies. Use for AI text generation, image creation, speech-to-text, embeddings, fine-tuning, and AI assistant building.

aj-openai-whisper

3891

from openclaw/skills

Local speech-to-text with the Whisper CLI (no API key).

openai Models for vwu.ai

3891

from openclaw/skills

vwu.ai 平台上的 openai 模型调用技能。

whisper-asr

3891

from openclaw/skills

本地 Whisper 语音识别配置。自动将飞书/Telegram 等渠道的语音消息转成文字。适用于需要离线、低延迟语音转文字的场景。

openai-symphony-autonomous-agents

3823

from openclaw/skills

Symphony turns project work into isolated, autonomous implementation runs, allowing teams to manage work instead of supervising coding agents.