voice-quality-audition
Audition Kokoro TTS voices to compare quality and grade. TRIGGERS - audition voices, kokoro voices, voice comparison, tts voice, voice quality, compare voices.
Best use case
voice-quality-audition is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Audition Kokoro TTS voices to compare quality and grade. TRIGGERS - audition voices, kokoro voices, voice comparison, tts voice, voice quality, compare voices.
Teams using voice-quality-audition should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/voice-quality-audition/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How voice-quality-audition Compares
| Feature / Agent | voice-quality-audition | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Audition Kokoro TTS voices to compare quality and grade. TRIGGERS - audition voices, kokoro voices, voice comparison, tts voice, voice quality, compare voices.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Voice Quality Audition Compare Kokoro TTS voice quality across all available voices. Runs `tts_kokoro_audition.sh` which plays a passage with each top voice. Each voice announces its name before reading the passage. Uses clipboard text or a default passage. > **Platform**: macOS (Apple Silicon) --- > **Self-Evolving Skill**: This skill improves through use. If instructions are wrong, parameters drifted, or a workaround was needed — fix this file immediately, don't defer. Only update for real, reproducible issues. ## When to Use This Skill - Audition all available Kokoro voices to hear quality differences - Compare specific voices side-by-side for a project - Re-evaluate voice grades after a Kokoro engine upgrade - Select a new default voice for TTS_VOICE_EN or TTS_VOICE_ZH - Test how a particular passage sounds across multiple voices --- ## Requirements - Kokoro TTS engine installed and healthy (`kokoro-install.sh --health`) - Apple Silicon Mac with MLX Metal available - `afplay` available (ships with macOS) - Audition script at plugin `scripts/tts_kokoro_audition.sh` - Shared library at plugin `scripts/lib/tts-common.sh` --- ## Voice Catalog | Voice ID | Name | Grade | Gender | | ---------- | ------- | ------- | ------ | | af_heart | Heart | A | Female | | af_bella | Bella | A- | Female | | af_nicole | Nicole | B- | Female | | af_aoede | Aoede | C+ | Female | | af_kore | Kore | C+ | Female | | af_sarah | Sarah | C+ | Female | | am_adam | Adam | F+ | Male | | am_michael | Michael | unrated | Male | | am_echo | Echo | D | Male | | am_puck | Puck | unrated | Male | **Current defaults** (configured in `~/.claude/automation/claude-telegram-sync/mise.toml`): - English voice: `af_heart` (Grade A) via `TTS_VOICE_EN` - Chinese voice: `zf_xiaobei` via `TTS_VOICE_ZH` - macOS `say` fallback EN: `Samantha` via `TTS_VOICE_SAY_EN` - macOS `say` fallback ZH: `Ting-Ting` via `TTS_VOICE_SAY_ZH` See [Voice Catalog](./references/voice-catalog.md) for detailed characteristics and grade criteria. --- ## Workflow Phases ### Phase 1: Preflight Verify Kokoro is installed and healthy: ```bash kokoro-install.sh --health ``` All 6 checks must pass (venv, Python 3.13, mlx_audio importable, kokoro_common.py, tts_generate.py, version.json). ### Phase 2: Text Selection The audition script reads from the macOS clipboard (`pbpaste`). If the clipboard is empty or not text, it falls back to a built-in passage about reading in a library. To audition with custom text, copy the desired passage to the clipboard before running. ### Phase 3: Ask User — Full or Selective Audition Use `AskUserQuestion` to determine scope: - **Full audition** — Play all 10 voices sequentially (takes several minutes) - **Select specific voices** — Run only a subset (e.g., top 3 female voices) For a selective audition, edit the `VOICES` array in the script or pass voice IDs manually. ### Phase 4: Execute Audition ```bash ~/.local/bin/tts_kokoro_audition.sh ``` Or directly from the plugin source: ```bash /path/to/plugins/tts-tg-sync/scripts/tts_kokoro_audition.sh ``` The script acquires the TTS lock, plays each voice sequentially with a 1-second gap, then releases the lock on exit. ### Phase 5: Feedback Use `AskUserQuestion` to collect the user's preference: - Which voice sounded best? - Any voices to eliminate from future consideration? - Should we update grade assignments? ### Phase 6: Apply Configuration Optionally update the default voice in mise.toml: ```toml # ~/.claude/automation/claude-telegram-sync/mise.toml [env] TTS_VOICE_EN = "af_heart" # Change to preferred voice ID TTS_VOICE_ZH = "zf_xiaobei" ``` After changing mise.toml, restart the Telegram bot for the new voice to take effect. --- ## TodoWrite Task Templates ``` 1. [Preflight] Verify Kokoro TTS is installed and healthy (kokoro-install.sh --health) 2. [Text] Check clipboard for passage, fall back to default if empty 3. [Select] Ask user: full audition (all 10 voices) or specific voices 4. [Audition] Run tts_kokoro_audition.sh and let user listen 5. [Feedback] Ask user which voice they prefer and collect grade feedback 6. [Apply] Optionally update TTS_VOICE_EN in mise.toml and restart bot ``` --- ## Post-Change Checklist - [ ] Kokoro health check passed before audition - [ ] All selected voices played without errors - [ ] User confirmed preferred voice - [ ] mise.toml updated with new voice ID (if changed) - [ ] Bot restarted after configuration change (if applicable) - [ ] Voice catalog grades updated in reference doc (if re-graded) ## Troubleshooting | Issue | Cause | Solution | | ------------------------------- | ---------------------------------- | ----------------------------------------------------------------------- | | No audio plays | Kokoro not installed | Run `kokoro-install.sh --install` or use `full-stack-bootstrap` skill | | Audio cuts off mid-sentence | TTS lock stolen by another process | Check for competing TTS processes: `pgrep -la afplay` | | Voice sounds wrong | Invalid voice ID in Kokoro model | Verify voice ID exists in `VOICES` array; check Kokoro version | | Clipboard empty | No text copied | Script uses default passage automatically; no action needed | | "ERROR: Local Kokoro not found" | Venv or script missing | Run `kokoro-install.sh --health` to diagnose; `--install` to fix | | FAILED for a specific voice | Voice not available in model | Voice may require a different Kokoro version; check model compatibility | | Lock not released | Script crashed without cleanup | Remove stale lock: `rm -f /tmp/kokoro-tts.lock` | | All voices sound identical | Kokoro model not loaded properly | Re-download model: `kokoro-install.sh --upgrade` | --- ## Reference Documentation - [Voice Catalog](./references/voice-catalog.md) - Comprehensive voice listing with quality grades, characteristics, and selection guidance - [Evolution Log](./references/evolution-log.md) - Change history for this skill ## Post-Execution Reflection After this skill completes, reflect before closing the task: 0. **Locate yourself.** — Find this SKILL.md's canonical path (Glob for this skill's name) before editing. All corrections target THIS file and its sibling references/ — never other documentation. 1. **What failed?** — Fix the instruction that caused it. If it could recur, add it as an anti-pattern. 2. **What worked better than expected?** — Promote it to recommended practice. Document why. 3. **What drifted?** — Any script, reference, or external dependency that no longer matches reality gets fixed now. 4. **Log it.** — Every change gets an evolution-log entry with trigger, fix, and evidence. Do NOT defer. The next invocation inherits whatever you leave behind. --- ---
Related Skills
settings-and-tuning
Configure TTS voices, speed, timeouts, queue depth, and bot settings. TRIGGERS - configure tts, change voice, tts speed, queue depth, tts timeout, bot config, tune settings, adjust parameters.
full-stack-bootstrap
One-time bootstrap for Kokoro TTS engine, Telegram bot, and BotFather setup. TRIGGERS - setup tts, install kokoro, botfather, bootstrap tts-tg-sync, configure telegram bot, full stack setup.
diagnostic-issue-resolver
Diagnose and resolve TTS and Telegram bot issues. TRIGGERS - tts not working, bot not responding, kokoro error, audio not playing, lock stuck, telegram bot troubleshoot, diagnose issue.
component-version-upgrade
Upgrade Kokoro model, bot dependencies, or TTS components. TRIGGERS - upgrade kokoro, update model, upgrade bot, update dependencies, version bump, component update.
clean-component-removal
Remove TTS and Telegram sync components cleanly. TRIGGERS - uninstall tts, remove telegram bot, uninstall kokoro, clean tts, teardown, component removal.
send-message
Use when user wants to send a text message on Telegram as their personal account via MTProto, text someone, or message a contact by username, phone, or chat ID.
send-media
Use when user wants to send or upload a file, photo, video, voice note, or document on Telegram via their personal account.
search-messages
Use when user wants to search for messages across all Telegram chats or within a specific chat, find old messages by text, or look up Telegram message history filtered by sender.
pin-message
Use when user wants to pin or unpin a message in a Telegram chat, group, or channel, or manage pinned messages.
mark-read
Use when user wants to mark Telegram chats as read, clear unread badges and mentions, dismiss notifications, or acknowledge messages to remove the unread counter.
manage-members
Use when user wants to manage Telegram group or channel members, including inviting users, kicking or banning someone, listing members, or filtering admins.
list-dialogs
Use when user wants to list all Telegram chats, groups, and channels, see their contacts, find a chat ID, browse conversations, or check account info via whoami.