callmac

Remote voice control for Mac from mobile devices using commands like /callmac or /voice. Broadcast announcements, play alarms, tell stories, wake up kids - all triggered from Telegram/WhatsApp messages. Uses edge-tts with mixed Chinese/English support, local playback, loops, and volume control. | 从移动设备通过 /callmac 或 /voice 命令远程控制 Mac 语音功能。广播公告、播放闹钟、讲故事、叫孩子起床 - 全部通过 Telegram/WhatsApp 消息触发。使用 edge-tts 支持中英文混合、本地播放、循环播放和音量控制。

7 stars

byDemerzels-lab

View on GitHub Installation ↓

Best use case

callmac is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using callmac should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/callmac/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/jooey/callmac/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/callmac/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How callmac Compares

Feature / Agent	callmac	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Callmac Skill

Remote voice control for your Mac from mobile devices. Broadcast announcements, play alarms, tell stories, wake up kids - all triggered from messaging apps like Telegram or WhatsApp. Uses edge-tts with mixed Chinese/English support.

从移动设备远程控制 Mac 语音功能。广播公告、播放闹钟、讲故事、叫孩子起床 - 全部通过 Telegram 或 WhatsApp 触发。使用 edge-tts 支持中英文混合。

## Quick Start | 快速开始

快速开始使用语音技能，生成和播放中英文混合的 TTS 音频。

### Basic Usage | 基本用法

```bash
# Generate and play a simple announcement
python3 scripts/generate_tts.py --text "Hello world" --play

# Generate mixed Chinese/English content
python3 scripts/generate_tts.py --text "Hello 你好" --play

# Save to file
python3 scripts/generate_tts.py --text "Your message" --output announcement.mp3
```

### Voice Selection

Edge TTS provides high-quality neural voices:
- **English (US):** `en-US-JennyNeural` (friendly), `en-US-AriaNeural` (confident)
- **Chinese (Mandarin):** `zh-CN-XiaoxiaoNeural` (warm), `zh-CN-XiaoyiNeural` (lively)
- **Other languages:** See [VOICES.md](references/VOICES.md) for complete list

## Features

### 1. Mixed Language Support
Automatically detects language segments and uses appropriate voices:
- English segments → English neural voice
- Chinese segments → Chinese neural voice
- Other languages → Default or specified voice

### 2. Playback Control
- Local playback on Mac using `afplay`
- Loop playback support (1-∞ times)
- Volume control (0-100%)
- Background/foreground playback options

### 3. File Management
- Save as MP3 files
- Concatenate multiple audio segments
- Batch processing support

### 4. Advanced Features
- Custom voice selection per segment
- Speech rate adjustment
- Pitch modification
- SSML support for advanced control

## Workflows

### Workflow 1: Simple Announcement Playback
1. User provides text to speak
2. System detects language(s)
3. Generates appropriate TTS audio
4. Plays locally on Mac

### Workflow 2: Mixed Language Audio Creation
1. User provides Chinese/English mixed text
2. System splits by language segments
3. Generates separate audio for each language
4. Concatenates into single MP3 file
5. Optionally plays or saves

### Workflow 3: Scheduled/Repeated Announcements
1. User provides text and playback schedule
2. System creates cron job or loop
3. Plays at specified intervals
4. Can be stopped on demand

## Scripts

### `scripts/generate_tts.py`
Main script for TTS generation and playback.

**Usage:**
```bash
# Basic generation and playback
python3 generate_tts.py --text "Message" --play

# Save to file
python3 generate_tts.py --text "Message" --output file.mp3

# Mixed language with custom voices
python3 generate_tts.py --text "Hello 你好" --voice-en "en-US-AriaNeural" --voice-zh "zh-CN-XiaoyiNeural"

# Loop playback
python3 generate_tts.py --text "Message" --play --loops 5

# Volume control
python3 generate_tts.py --text "Message" --play --volume 80

# SSML support
python3 generate_tts.py --ssml "<speak>Hello <break time='500ms'/> world</speak>"
```

**Parameters:**
- `--text`: Text to convert to speech
- `--ssml`: SSML markup for advanced control
- `--output`: Save to MP3 file
- `--play`: Play immediately after generation
- `--voice`: Default voice (overrides auto-detection)
- `--voice-en`: English voice override
- `--voice-zh`: Chinese voice override
- `--loops`: Number of times to play (default: 1)
- `--volume`: Playback volume 0-100 (default: system volume)
- `--rate`: Speech rate adjustment (+/- %)
- `--pitch`: Pitch adjustment (+/- Hz)

### `scripts/play_audio.py`
Audio playback control utilities.

**Usage:**
```bash
# Play existing audio file
python3 play_audio.py --file audio.mp3

# Loop playback
python3 play_audio.py --file audio.mp3 --loops 3

# Volume control
python3 play_audio.py --file audio.mp3 --volume 75

# Stop all playback
python3 play_audio.py --stop
```

### `scripts/merge_audio.py`
Merge multiple audio files.

**Usage:**
```bash
# Merge two files
python3 merge_audio.py --files part1.mp3 part2.mp3 --output combined.mp3

# Create playlist and merge
python3 merge_audio.py --playlist playlist.txt --output combined.mp3
```

## References

### [VOICES.md](references/VOICES.md)
Complete list of available Edge TTS voices with language, gender, and style information.

### [INSTALLATION.md](references/INSTALLATION.md)
Setup instructions for edge-tts and ffmpeg dependencies.

### [EXAMPLES.md](references/EXAMPLES.md)
Example usage patterns and common scenarios.

## Dependencies

1. **edge-tts**: `pip3 install edge-tts`
2. **ffmpeg**: `brew install ffmpeg` (for audio merging)
3. **macOS tools**: `afplay` (built-in), `osascript` (volume control)

## Common Patterns

### Pattern 1: Welcome Message
```bash
python3 scripts/generate_tts.py --text "Welcome to the system. 系统欢迎您。" --play --voice-en "en-US-JennyNeural" --voice-zh "zh-CN-XiaoxiaoNeural"
```

### Pattern 2: Scheduled Reminder
```bash
# Create audio file
python3 scripts/generate_tts.py --text "Time for your meeting. 会议时间到了。" --output reminder.mp3

# Schedule playback (using cron)
echo "*/30 * * * * afplay /path/to/reminder.mp3" | crontab -
```

### Pattern 3: Multi-language Announcement
```bash
python3 scripts/generate_tts.py --text "Alert: System update in progress. 警报：系统更新进行中。" --play --loops 3 --volume 90
```

## Troubleshooting

### No Audio Output
1. Check system volume: `osascript -e "output volume of (get volume settings)"`
2. Check mute status: `osascript -e "output muted of (get volume settings)"`
3. Test with simple audio: `afplay /System/Library/Sounds/Ping.aiff`

### edge-tts Not Working
1. Verify installation: `python3 -m edge_tts --list-voices`
2. Check internet connection (edge-tts requires online access)
3. Update package: `pip3 install --upgrade edge-tts`

### Audio Quality Issues
1. Use neural voices for best quality
2. Adjust speech rate if too fast/slow
3. Consider SSML for precise control

## Notes

- Edge TTS requires internet connection for voice synthesis
- Local playback uses macOS built-in `afplay` command
- Volume control uses `osascript` to adjust system volume
- For offline use, consider alternative TTS solutions
- Large texts may be split into multiple segments automatically

Related Skills

paylock

from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.

Telecom Agent Skill

from Demerzels-lab/elsamultiskillagent

Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.

OpenClaw-Finnhub

from Demerzels-lab/elsamultiskillagent

OpenClaw skill for real-time stock quote, and financials via Finnhub API.

```markdown

from Demerzels-lab/elsamultiskillagent

# OpenClaw-Last.fm

security-operator

from Demerzels-lab/elsamultiskillagent

Runtime security guardrails for OpenClaw agents.

operator-humanizer

from Demerzels-lab/elsamultiskillagent

Transform AI-generated text into authentic human writing.

kit-email-operator

from Demerzels-lab/elsamultiskillagent

**AI-powered email marketing for Kit (ConvertKit)**.

agora

from Demerzels-lab/elsamultiskillagent

Trade prediction markets on Agora — the prediction market exclusively for AI agents. Register, browse markets, trade YES/NO, create markets, earn reputation via Brier scores.

surf-check

from Demerzels-lab/elsamultiskillagent

Surf forecast decision engine.

jinko-flight-search

from Demerzels-lab/elsamultiskillagent

Search flights and discover travel destinations using the Jinko MCP server. Provides two core capabilities: (1) Destination discovery — find where to travel based on criteria like budget, climate, or activities when the user has no specific destination in mind, and (2) Specific flight search — compare flights between two known cities/airports with flexible dates, cabin classes, and budget filters. Use this skill when the user wants to: search for flights, find cheap flights, discover travel destinations, compare flight prices, plan a trip, find deals from a specific city, or explore where to go. Triggers on any flight-booking, travel-planning, or destination-discovery request. Requires the Jinko MCP server connected at https://mcp.gojinko.com.

mlx-whisper

from Demerzels-lab/elsamultiskillagent

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).