IMA Music Generator

Generate voiceovers, narration, and spoken audio for videos, explainers, ads, and social content.

3,891 stars

Best use case

IMA Music Generator is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate voiceovers, narration, and spoken audio for videos, explainers, ads, and social content.

Teams using IMA Music Generator should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ima-voice-ai/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/allenfancy-gan/ima-voice-ai/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/ima-voice-ai/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How IMA Music Generator Compares

Feature / AgentIMA Music GeneratorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate voiceovers, narration, and spoken audio for videos, explainers, ads, and social content.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# IMA Voice AI — Music Generator

**For complete API documentation, security details, all parameters, and Python examples, read `SKILL-DETAIL.md`.**

## Model ID Reference (CRITICAL)

Use **exact model_id** from this table. Do NOT infer from friendly names.

| Friendly Name | model_id | Notes |
|---------------|----------|-------|
| Suno | `sonic` | ✅ Default, full songs with vocals |
| DouBao BGM | `GenBGM` | ⚠️ Instrumental only, background music |
| DouBao Song | `GenSong` | ⚠️ Songs with vocals |

**User input aliases:** BGM/背景音乐/纯音乐 → `GenBGM` · 歌曲/人声/Song → `sonic` or `GenSong` · 默认 → `sonic`

## Music Generation Mode

| User intent | model_id | When to use |
|-------------|----------|-------------|
| Background music, instrumental | `GenBGM` | "做一段BGM" / "纯音乐" / "背景音乐" |
| Song with vocals | `sonic` | "写首歌" / "带人声" / "歌曲" |
| Song (alternative) | `GenSong` | "豆包歌曲" / "GenSong" |

## Model Selection Priority

1. **User preference** (if explicitly stated) → highest priority
2. **Fallback default:** `sonic` (Suno)

| Task | Default Model | model_id | Notes |
|------|--------------|----------|-------|
| General music | Suno | `sonic` | Full songs, vocals |
| Instrumental/BGM | DouBao BGM | `GenBGM` | No vocals |
| Chinese songs | DouBao Song | `GenSong` | Alternative to Suno |

## Script Usage

```bash
# Generate music (default: sonic/Suno)
python3 {baseDir}/scripts/ima_voice_create.py \
  --model-id sonic \
  --prompt "upbeat lo-fi hip hop, 90 BPM, no vocals" \
  --output-json

# List available models
python3 {baseDir}/scripts/ima_voice_create.py --list-models

# Generate BGM
python3 {baseDir}/scripts/ima_voice_create.py \
  --model-id GenBGM \
  --prompt "calm piano background music for meditation" \
  --output-json
```

## Sending Results to User

```python
# ✅ CORRECT: Use remote URL directly for inline audio display
message(action="send", media=audio_url, caption="✅ 音乐生成成功!\n• 模型:[Name]\n• 耗时:[X]s\n• 积分:[N pts]\n\n🔗 原始链接:[url]")

# ❌ WRONG: Never download to local file
```

## UX Protocol (Brief)

1. **Pre-generation:** "🎵 开始生成音乐… 模型:[Name],预计[X~Y]秒,消耗[N]积分"
2. **Progress:** Every 30-60s: "⏳ 正在生成中… [P]%" (cap at 95%)
3. **Success:** Send audio via `media=audio_url` + include link in caption
4. **Failure:** Natural language error + suggest alternative models. See SKILL-DETAIL.md for error translation.

**Never say to users:** script names, API endpoints, attribute_id, technical parameter names. Only: model name · time · credits · result · status.

## Environment

Base URL: `https://api.imastudio.com`
Headers: `Authorization: Bearer $IMA_API_KEY` · `x-app-source: ima_skills` · `x_app_language: en`

## Core Flow

1. `GET /open/v1/product/list?app=ima&platform=web&category=text_to_music` → get `attribute_id`, `credit`, `model_version`
2. `POST /open/v1/tasks/create` → get `task_id`
3. `POST /open/v1/tasks/detail` → poll every 5s until `resource_status==1`

**MANDATORY:** Always query product list first. `attribute_id` is required.

## Defaults and Timeouts

- Task type: `text_to_music` (fixed)
- Poll interval: 5 seconds
- Max poll wait: 8 minutes
- Default model: `sonic` (if `--model-id` omitted)

## Estimated Generation Time

| Model | Estimated Time | Poll Every |
|-------|---------------|------------|
| Suno (sonic) | 60~180s | 5s |
| DouBao BGM (GenBGM) | 30~90s | 5s |
| DouBao Song (GenSong) | 60~120s | 5s |

Related Skills

Invoice Generator

3891
from openclaw/skills

Creates professional invoices in markdown and HTML

Workflow & Productivity

Incident Postmortem Generator

3891
from openclaw/skills

Generate blameless incident postmortems from raw notes, Slack threads, or bullet points.

DevOps & Infrastructure

Partnership Agreement Generator

3891
from openclaw/skills

Generate comprehensive partnership agreements, joint venture frameworks, and strategic alliance documents for B2B relationships.

Legal Documents & Agreements

Employee Onboarding Generator

3891
from openclaw/skills

Build a structured 90-day onboarding plan for any role. Covers pre-boarding, Day 1, Week 1, 30/60/90-day milestones, buddy assignments, and success metrics.

Workflow & Productivity

Employee Handbook Generator

3891
from openclaw/skills

Build a complete, customized employee handbook for your company. Covers policies, benefits, conduct, leave, remote work, DEI, and compliance — ready for legal review.

Content & Documentation

IT Disaster Recovery Plan Generator

3891
from openclaw/skills

Build production-ready disaster recovery plans that actually get followed when things break.

DevOps & Infrastructure

Compliance Audit Generator

3891
from openclaw/skills

Run internal compliance audits against major frameworks without hiring a consultant.

Security

API Documentation Generator

3891
from openclaw/skills

Generate production-ready API documentation from endpoint descriptions. Outputs OpenAPI 3.0, markdown reference docs, and SDK quickstart guides.

Coding & Development

Annual Report Generator

3891
from openclaw/skills

Build a complete annual business report from raw data. Covers financial performance, operational metrics, strategic highlights, and forward-looking guidance.

Workflow & Productivity

daily-report-generator

3891
from openclaw/skills

Automatically generate daily/weekly work reports from git commits, calendar events, and task lists. Use when you need to quickly create professional work reports without manual effort.

Workflow & Productivity

hr-policy-generator

3891
from openclaw/skills

Comprehensive HR policy development covering attendance, time-off, overtime, remote work, and compliance. Generates structured policy documents, legal checklists, exception handling frameworks, and employee communication plans tailored to company size, work arrangement, and jurisdiction.

Workflow & Productivity

hr-policy-generator-cn

3891
from openclaw/skills

综合性 HR 政策设计工具,覆盖考勤、休假、加班、远程办公及合规要求。根据公司规模、办公模式、适用法律等输入,生成完整的政策文档、法律合规清单、例外处理机制及员工沟通方案。

Workflow & Productivity