Speech is Cheap (SIC) Skill

Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.

7 stars

Best use case

Speech is Cheap (SIC) Skill is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.

Teams using Speech is Cheap (SIC) Skill should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/asr/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/ilyakam/asr/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/asr/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Speech is Cheap (SIC) Skill Compares

Feature / AgentSpeech is Cheap (SIC) SkillStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Speech is Cheap (SIC) Skill

Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.

## 🚀 Why use this skill?
- **Disruptive Pricing:** $0.06 - $0.12 per hour (2-15x cheaper than Deepgram or OpenAI).
- **Extreme Speed:** 100 minutes of audio transcribes in ~1 minute.
- **Multilingual:** Supports 100 languages with auto-detection.
- **Agent-Ready:** Designed for high-volume, automated pipelines.

## 🛠 Setup

### 1. Get an API Key
Sign up at [speechischeap.com](https://speechischeap.com). Use code `CH5` for $5 off.

### 2. Configure Authentication
This skill looks for your API key in the `SIC_API_KEY` environment variable.

Add this to your `.env` or agent config:
```bash
SIC_API_KEY=your_key_here
```

## 📖 Usage
### 🤖 TL;DR for Agents
When this skill is installed, you can transcribe any URL from an OpenClaw session and get the JSON results immediately by running:
`./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"`

### Transcribe a URL
```bash
# Basic transcription
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"

# Advanced transcription with options
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" \
  --speakers --words --labels \
  --language "en" \
  --format "srt" \
  --private
```

### Transcribe a Local File
Perfect for processing audio already on your disk. This handles the upload automatically.
```bash
# Upload and transcribe local media
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav"

# Upload with webhook callback
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav" --webhook "https://mysite.com/callback"

# Note: For local files, the skill handles the multi-part upload to
# https://upload.speechischeap.com before starting the transcription.
```

### Supported Options
- `--speakers`: Enable speaker diarization
- `--words`: Enable word-level timestamps
- `--labels`: Enable audio labeling (music, noise, etc.)
- `--stream`: Enable streaming output
- `--private`: Do not store audio/transcript (privacy mode)
- `--language <code>`: ISO language code (e.g., 'en', 'es')
- `--confidence <float>`: Minimum confidence threshold (default 0.5)
- `--format <fmt>`: Output format (json, srt, vtt, webvtt)
- `--webhook <url>`: URL to receive job completion payload
- `--segment-duration <n>`: Segment duration in seconds (default 30)

### Check Job Status
```bash
./skills/asr/scripts/asr.sh status "job-id-here"
```

## 🤖 For Agents
The `asr.sh` command-line tool returns JSON by default when successful, making it easy to pipe into other tools or parse directly.

If the `SIC_API_KEY` is missing, the tool will provide a clear error message and a direct link to the signup page.

Related Skills

miranda-elevenlabs-speech

7
from Demerzels-lab/elsamultiskillagent

Text-to-Speech and Speech-to-Text using ElevenLabs AI.

namecheap-dns

7
from Demerzels-lab/elsamultiskillagent

Safe DNS record management for Namecheap domains.

chichi-speech

7
from Demerzels-lab/elsamultiskillagent

A RESTful service for high-quality text-to-speech using Qwen3.

cheapest-image-generation

7
from Demerzels-lab/elsamultiskillagent

Possibly the cheapest AI image generation (~$0.0036/image)

salute-speech

7
from Demerzels-lab/elsamultiskillagent

Transcribe audio files using Sber Salute Speech async API.

speechall-cli

7
from Demerzels-lab/elsamultiskillagent

Install and use the speechall CLI tool for speech-to-text transcription.

elevenlabs-speech

7
from Demerzels-lab/elsamultiskillagent

Text-to-Speech and Speech-to-Text using ElevenLabs AI. Use when the user wants to convert text to speech, transcribe voice messages, or work with voice in multiple languages. Supports high-quality AI voices and accurate transcription.

paylock

7
from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

7
from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.

Telecom Agent Skill

7
from Demerzels-lab/elsamultiskillagent

Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.

OpenClaw-Finnhub

7
from Demerzels-lab/elsamultiskillagent

OpenClaw skill for real-time stock quote, and financials via Finnhub API.

```markdown

7
from Demerzels-lab/elsamultiskillagent

# OpenClaw-Last.fm