IMA Studio Music Generation
Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.
Best use case
IMA Studio Music Generation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.
Teams using IMA Studio Music Generation should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/ima-voice-ai/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How IMA Studio Music Generation Compares
| Feature / Agent | IMA Studio Music Generation | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
SKILL.md Source
# IMA Voice AI Creation
## Capability
This skill generates music/audio from text prompts (`text_to_music`) through IMA Open API.
Supported model IDs:
- `sonic` (Suno)
- `GenBGM` (DouBao BGM)
- `GenSong` (DouBao Song)
## Network and Credential Transparency
- API domain used: `https://api.imastudio.com`
- Required key: `IMA_API_KEY` (environment variable)
- The script does not call secondary upload domains.
- The script does not read other skills' files.
## Runtime Rules
1. Always query `/open/v1/product/list` first.
2. Resolve `attribute_id`, `credit`, and latest `model_version` from product list.
3. Create task via `/open/v1/tasks/create`.
4. Poll `/open/v1/tasks/detail` until completed or timeout.
## Defaults and Timeouts
- Task type is fixed to `text_to_music`.
- Poll interval: 5 seconds.
- Max poll wait: 8 minutes.
- If `--model-id` is omitted, default model is `sonic`.
## User Input Mapping
- BGM / instrumental / 背景音乐 / 纯音乐 -> `GenBGM`
- Song / lyrics / 人声 / 歌曲 -> `sonic` or `GenSong`
- If unspecified -> default `sonic`
## Script Invocation
Set key first:
```bash
export IMA_API_KEY="ima_your_key_here"
```
```bash
python3 {baseDir}/scripts/ima_voice_create.py \
--model-id sonic \
--prompt "upbeat lo-fi hip hop, 90 BPM, no vocals" \
--output-json
```
List models:
```bash
python3 {baseDir}/scripts/ima_voice_create.py \
--list-models
```
## Error Handling Policy
- Return user-friendly error summaries in plain language.
- Include practical next step suggestions (retry, switch model, check API key/credits).
- Do not expose raw internal payloads unless debugging is explicitly requested.
## Expected Output
On success, return:
- task id
- result URL
- model id/model name
- credit used
If `--output-json` is enabled, parse JSON from script output.Related Skills
Gym & Fitness Studio Operations
Run a profitable gym or fitness studio. Covers membership pricing, retention, staffing, equipment lifecycle, class scheduling, and growth.
revenue-studio
A revenue-first solofounder studio that watches markets, finds monetizable pain, validates offers, ships narrow products, and compounds commercial memory across launches. Uses massive parallel agent orchestration with 8 layers: Signal Mesh, Extraction, Opportunity Graph, Cofounder Council, Revenue Lab, Build Studio, Launch Loop, and Portfolio Allocator.
webinar-repurpose-studio
把 webinar 内容拆成回放介绍、短文案、社媒提纲、FAQ 和后续邮件。;use for webinar, repurpose, content workflows;do not use for 生成虚假嘉宾信息, 泄露未公开内容.
openrouter-image-generation
Generate or edit images through OpenRouter's multimodal image generation endpoint (`/api/v1/chat/completions`) using OpenRouter-compatible image models. Use for text-to-image or image-to-image requests when the user wants OpenRouter, `OPENROUTER_API_KEY`, model overrides, or provider-specific `image_config` options.
PDF Generation Skill
**Purpose:** Generate professional PDFs from HTML/CSS without whitespace gaps or layout issues.
ohyesai-music
Generate custom music tracks (vocal or instrumental) via OhYesAI asynchronously.
🎵 Play Music Skill
**Controlled music player with pause/resume/stop support**
lead-generation
Lead Generation — Find high-intent buyers in live Twitter, Instagram, and Reddit conversations. Auto-researches your product, generates targeted search queries, and discovers people actively looking for solutions you offer. Social selling and prospecting powered by 1.5B+ indexed posts via Xpoz MCP.
home-music
Control whole-house music scenes combining Spotify playback with Airfoil speaker routing. Quick presets for morning, party, chill modes.
Music Playlist Generation (PlaylistGen)
LLM-powered playlist generation for your local music library that contains music audio files such as mp3, flac, m4a, etc. Integrate this skill with your Agents (e.g., OpenClaw) to enable natural language music discovery and playlist curation through conversation. Point it at your music folder, run the indexer once, and get a natural language playlist generator — accessible via web browser or API.
asmr-music
Provide professional ASMR music recommendations and playback services to help users achieve sleep aid, heart rate regulation, and focus enhancement through scientifically designed audio. Use when user wants to (1) relax, sleep, or fall asleep, (2) regulate heart rate or relieve anxiety, (3) improve focus and concentration for study or work, (4) find background white noise or ambient sounds, (5) practice meditation or deep breathing, (6) set up automatic daily reminders for ASMR listening. Keywords: sleep, relax, meditation, heart rate, anxiety, focus, concentration, white noise, ASMR, background music, calming, stress relief, daily reminder, nightly routine.
🎨 Pollinations Image Generation Skill
A clean, modular, and user-friendly image generation tool powered by [Pollinations.ai](https://pollinations.ai).