openai-whisper-api

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

533 stars

Best use case

openai-whisper-api is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

Teams using openai-whisper-api should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/openai-whisper-api/SKILL.md --create-dirs "https://raw.githubusercontent.com/sundial-org/awesome-openclaw-skills/main/skills/openai-whisper-api/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/openai-whisper-api/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How openai-whisper-api Compares

Feature / Agentopenai-whisper-apiStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# OpenAI Whisper API (curl)

Transcribe an audio file via OpenAI’s `/v1/audio/transcriptions` endpoint.

## Quick start

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a
```

Defaults:
- Model: `whisper-1`
- Output: `<input>.txt`

## Useful flags

```bash
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --model whisper-1 --out /tmp/transcript.txt
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --language en
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --prompt "Speaker names: Peter, Daniel"
{baseDir}/scripts/transcribe.sh /path/to/audio.m4a --json --out /tmp/transcript.json
```

## API key

Set `OPENAI_API_KEY`, or configure it in `~/.clawdbot/clawdbot.json`:

```json5
{
  skills: {
    "openai-whisper-api": {
      apiKey: "OPENAI_KEY_HERE"
    }
  }
}
```

Related Skills

openai-whisper

533
from sundial-org/awesome-openclaw-skills

Local speech-to-text with the Whisper CLI (no API key).

openai-tts

533
from sundial-org/awesome-openclaw-skills

Text-to-speech via OpenAI Audio Speech API.

openai-image-gen

533
from sundial-org/awesome-openclaw-skills

Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.

openai-docs-skill

533
from sundial-org/awesome-openclaw-skills

Query the OpenAI developer documentation via the OpenAI Docs MCP server using CLI (curl/jq). Use whenever a task involves the OpenAI API (Responses, Chat Completions, Realtime, etc.), OpenAI SDKs, ChatGPT Apps SDK, Codex, MCP integrations, endpoint schemas, parameters, limits, or migrations and you need up-to-date official guidance.

mlx-whisper

533
from sundial-org/awesome-openclaw-skills

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

local-whisper

533
from sundial-org/awesome-openclaw-skills

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

llmwhisperer

533
from sundial-org/awesome-openclaw-skills

Extract text and layout from images and PDFs using LLMWhisperer API. Good for handwriting and complex forms.

portfolio-watcher

533
from sundial-org/awesome-openclaw-skills

Monitor stock/crypto holdings, get price alerts, track portfolio performance

portainer

533
from sundial-org/awesome-openclaw-skills

Control Docker containers and stacks via Portainer API. List containers, start/stop/restart, view logs, and redeploy stacks from git.

portable-tools

533
from sundial-org/awesome-openclaw-skills

Build cross-device tools without hardcoding paths or account names

polymarket

533
from sundial-org/awesome-openclaw-skills

Trade prediction markets on Polymarket. Analyze odds, place bets, track positions, automate alerts, and maximize returns from event outcomes. Covers sports, politics, entertainment, and more.

polymarket-traiding-bot

533
from sundial-org/awesome-openclaw-skills

No description provided.