livekit-voice
LiveKit real-time voice and video infrastructure — create rooms, generate JWT access tokens, manage participants, and record sessions. Open source WebRTC for voice AI agents and real-time communication. Use for building voice agents, video rooms, or real-time audio.
Best use case
livekit-voice is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
LiveKit real-time voice and video infrastructure — create rooms, generate JWT access tokens, manage participants, and record sessions. Open source WebRTC for voice AI agents and real-time communication. Use for building voice agents, video rooms, or real-time audio.
Teams using livekit-voice should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/livekit-voice/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How livekit-voice Compares
| Feature / Agent | livekit-voice | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
LiveKit real-time voice and video infrastructure — create rooms, generate JWT access tokens, manage participants, and record sessions. Open source WebRTC for voice AI agents and real-time communication. Use for building voice agents, video rooms, or real-time audio.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
AI Agent for YouTube Script Writing
Find AI agent skills for YouTube script writing, video research, content outlining, and repeatable channel production workflows.
SKILL.md Source
# 🎧 LiveKit Voice
LiveKit real-time voice/video infrastructure for OpenClaw agents. Create rooms, generate tokens, manage participants, and integrate with voice AI platforms.
## What is LiveKit?
[LiveKit](https://livekit.io) is an open-source WebRTC infrastructure platform for building real-time audio/video applications. It powers voice AI agents, video conferencing, live streaming, and more.
**Self-hosted vs Cloud:**
- **LiveKit Cloud** — Managed service, no infrastructure to maintain
- **Self-hosted** — Deploy on your own servers via Docker/Kubernetes
## Requirements
| Variable | Required | Description |
|----------|----------|-------------|
| `LIVEKIT_API_KEY` | ✅ | LiveKit API key |
| `LIVEKIT_API_SECRET` | ✅ | LiveKit API secret |
| `LIVEKIT_URL` | ✅ | LiveKit server URL (e.g. `wss://your-project.livekit.cloud`) |
## Quick Start
```bash
# Create a room
python3 {baseDir}/scripts/livekit_api.py create-room my-room
# Create room with options
python3 {baseDir}/scripts/livekit_api.py create-room my-room --max-participants 10 --empty-timeout 300
# Generate access token for a participant
python3 {baseDir}/scripts/livekit_api.py token my-room --identity user123 --name "John"
# Generate token with specific grants
python3 {baseDir}/scripts/livekit_api.py token my-room --identity agent --can-publish --can-subscribe
# List active rooms
python3 {baseDir}/scripts/livekit_api.py list-rooms
# List participants in a room
python3 {baseDir}/scripts/livekit_api.py participants my-room
# Delete a room
python3 {baseDir}/scripts/livekit_api.py delete-room my-room
# Start recording (Egress)
python3 {baseDir}/scripts/livekit_api.py record my-room --output s3://bucket/recording.mp4
```
## Commands
### `create-room <name>`
Create a new LiveKit room.
- `--max-participants N` — limit participants
- `--empty-timeout N` — seconds before empty room auto-closes (default 300)
### `token <room>`
Generate a JWT access token for a participant.
- `--identity ID` — participant identity (required)
- `--name NAME` — display name
- `--can-publish` — allow publishing audio/video
- `--can-subscribe` — allow subscribing to others
- `--ttl N` — token TTL in seconds (default 3600)
### `list-rooms`
List all active rooms with participant counts.
### `participants <room>`
List participants in a room with their connection state and tracks.
### `delete-room <name>`
Delete/close a room and disconnect all participants.
### `record <room>`
Start an Egress recording of a room.
- `--output URL` — output destination (S3, GCS, or local path)
## Voice AI Integration
LiveKit is the backbone for many voice AI platforms:
- **Vapi** — Uses LiveKit for real-time voice AI agent calls
- **ElevenLabs** — Stream TTS audio into LiveKit rooms
- **OpenAI Realtime** — Connect GPT-4o voice to LiveKit participants
### Agent Pattern
1. Create a LiveKit room
2. Generate tokens for both human and AI agent
3. AI agent joins, subscribes to human audio
4. Process audio → STT → LLM → TTS → publish back
5. Result: real-time voice conversation with AI
## Credits
Built by [M. Abidi](https://www.linkedin.com/in/mohammad-ali-abidi) | [agxntsix.ai](https://www.agxntsix.ai)
[YouTube](https://youtube.com/@aiwithabidi) | [GitHub](https://github.com/aiwithabidi)
Part of the **AgxntSix Skill Suite** for OpenClaw agents.
📅 **Need help setting up OpenClaw for your business?** [Book a free consultation](https://cal.com/agxntsix/abidi-openclaw)Related Skills
Invoice Generator
Creates professional invoices in markdown and HTML
brand-voice-generator
Creates consistent brand voice guidelines and content. Generates copy that matches your brand personality across all channels. Perfect for startups building their identity.
invoice-ocr
发票 OCR 识别技能。扫描文件夹中的发票文件(PDF/图片),调用阿里云 OCR API 识别发票信息并导出到 Excel 表格。支持 17+ 种发票类型(增值税发票、火车票、出租车票、机票行程单、定额发票、机动车销售发票、过路过桥费发票等)。使用场景:(1) 用户提到"发票识别"、"发票统计"、"发票整理"、"发票汇总" (2) 用户需要批量处理发票 (3) 用户提到阿里云 OCR 识别发票。**重要:首次使用必须先配置阿里云凭证,主动向用户索要 AccessKey ID 和 AccessKey Secret,或引导用户运行 --config 命令自行配置。**
Bland AI — Voice Calling Skill
Make and manage AI-powered phone calls via the Bland AI API.
afrexai-invoice-engine
Generate, manage, and track professional invoices with payment terms, recurring billing, overdue automation, and financial reporting. Use when creating invoices, tracking payments, managing clients, or reviewing revenue.
voice-tts
语音输入(Whisper ASR)+ 语音输出(Edge TTS)技能,支持 agent 专属音色,可调用 send_voice_reply.mjs 发送 Telegram 语音消息。
amber-voice-assistant
AI phone assistant and virtual receptionist for OpenClaw. Answers inbound phone calls, screens callers, makes outbound phone calls, and books appointments — all over Twilio + OpenAI Realtime voice. Full telephone workflow: phone call screening, live call transcripts, CRM contact memory, calendar integration. Ideal for anyone who wants an AI to answer their phone, handle call screening, or make phone calls autonomously. Includes interactive setup wizard, live call dashboard, and human-in-the-loop escalation. Also ships as a Claude Desktop MCP plugin — dial phone numbers, check call history, query CRM, and manage calendar directly from Claude Desktop.
discord-voice
Real-time voice conversations in Discord voice channels with Claude AI
feishu-voice-assistant
Sends voice messages (audio) to Feishu chats using Duby TTS.
invoice-chaser
Automated invoice follow-up sequences that escalate from friendly to firm. Track unpaid invoices, send timed reminder emails with escalating tone, log payment interactions, and generate AR aging reports. Your agent handles the awkward conversations so you don't have to — preserving cash flow and client relationships while you focus on actual work. Configure invoice tracking, email templates per stage (friendly → firm → final notice), timing rules, and let your agent chase payments 24/7. Use when adding invoices, running payment chases, checking status, or generating accounts receivable reports.
voiceclaw
Local voice I/O for OpenClaw agents. Transcribe inbound audio/voice messages using local Whisper (whisper.cpp) and generate voice replies using local Piper TTS. Requires whisper, piper, and ffmpeg pre-installed on the system. All inference runs on-device — no network calls, no cloud APIs, no API keys. Use when an agent receives a voice/audio message and should respond in both voice and text, or when any text response should be synthesized and sent as audio. Triggers on: voice messages, audio attachments, respond in voice, send as audio, speak this, voiceclaw.
anvevoice
Add AI voice assistants to your website. Engage visitors with natural voice conversations, capture leads, automate support, and boost conversions.