IMA Studio Music Generation

Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.

3,762 stars

Best use case

IMA Studio Music Generation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.

Teams using IMA Studio Music Generation should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ima-voice-ai/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/allenfancy-gan/ima-voice-ai/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/ima-voice-ai/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How IMA Studio Music Generation Compares

Feature / AgentIMA Studio Music GenerationStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate music from text with IMA Open API. Supports Suno (sonic), DouBao BGM (GenBGM), and DouBao Song (GenSong). Uses only api.imastudio.com.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# IMA Voice AI Creation

## Capability

This skill generates music/audio from text prompts (`text_to_music`) through IMA Open API.

Supported model IDs:
- `sonic` (Suno)
- `GenBGM` (DouBao BGM)
- `GenSong` (DouBao Song)

## Network and Credential Transparency

- API domain used: `https://api.imastudio.com`
- Required key: `IMA_API_KEY` (environment variable)
- The script does not call secondary upload domains.
- The script does not read other skills' files.

## Runtime Rules

1. Always query `/open/v1/product/list` first.
2. Resolve `attribute_id`, `credit`, and latest `model_version` from product list.
3. Create task via `/open/v1/tasks/create`.
4. Poll `/open/v1/tasks/detail` until completed or timeout.

## Defaults and Timeouts

- Task type is fixed to `text_to_music`.
- Poll interval: 5 seconds.
- Max poll wait: 8 minutes.
- If `--model-id` is omitted, default model is `sonic`.

## User Input Mapping

- BGM / instrumental / 背景音乐 / 纯音乐 -> `GenBGM`
- Song / lyrics / 人声 / 歌曲 -> `sonic` or `GenSong`
- If unspecified -> default `sonic`

## Script Invocation

Set key first:

```bash
export IMA_API_KEY="ima_your_key_here"
```

```bash
python3 {baseDir}/scripts/ima_voice_create.py \
  --model-id sonic \
  --prompt "upbeat lo-fi hip hop, 90 BPM, no vocals" \
  --output-json
```

List models:

```bash
python3 {baseDir}/scripts/ima_voice_create.py \
  --list-models
```

## Error Handling Policy

- Return user-friendly error summaries in plain language.
- Include practical next step suggestions (retry, switch model, check API key/credits).
- Do not expose raw internal payloads unless debugging is explicitly requested.

## Expected Output

On success, return:
- task id
- result URL
- model id/model name
- credit used

If `--output-json` is enabled, parse JSON from script output.

Related Skills

Gym & Fitness Studio Operations

3891
from openclaw/skills

Run a profitable gym or fitness studio. Covers membership pricing, retention, staffing, equipment lifecycle, class scheduling, and growth.

revenue-studio

3891
from openclaw/skills

A revenue-first solofounder studio that watches markets, finds monetizable pain, validates offers, ships narrow products, and compounds commercial memory across launches. Uses massive parallel agent orchestration with 8 layers: Signal Mesh, Extraction, Opportunity Graph, Cofounder Council, Revenue Lab, Build Studio, Launch Loop, and Portfolio Allocator.

webinar-repurpose-studio

3891
from openclaw/skills

把 webinar 内容拆成回放介绍、短文案、社媒提纲、FAQ 和后续邮件。;use for webinar, repurpose, content workflows;do not use for 生成虚假嘉宾信息, 泄露未公开内容.

openrouter-image-generation

3891
from openclaw/skills

Generate or edit images through OpenRouter's multimodal image generation endpoint (`/api/v1/chat/completions`) using OpenRouter-compatible image models. Use for text-to-image or image-to-image requests when the user wants OpenRouter, `OPENROUTER_API_KEY`, model overrides, or provider-specific `image_config` options.

PDF Generation Skill

3891
from openclaw/skills

**Purpose:** Generate professional PDFs from HTML/CSS without whitespace gaps or layout issues.

ohyesai-music

3891
from openclaw/skills

Generate custom music tracks (vocal or instrumental) via OhYesAI asynchronously.

🎵 Play Music Skill

3891
from openclaw/skills

**Controlled music player with pause/resume/stop support**

lead-generation

3891
from openclaw/skills

Lead Generation — Find high-intent buyers in live Twitter, Instagram, and Reddit conversations. Auto-researches your product, generates targeted search queries, and discovers people actively looking for solutions you offer. Social selling and prospecting powered by 1.5B+ indexed posts via Xpoz MCP.

home-music

3891
from openclaw/skills

Control whole-house music scenes combining Spotify playback with Airfoil speaker routing. Quick presets for morning, party, chill modes.

Music Playlist Generation (PlaylistGen)

3891
from openclaw/skills

LLM-powered playlist generation for your local music library that contains music audio files such as mp3, flac, m4a, etc. Integrate this skill with your Agents (e.g., OpenClaw) to enable natural language music discovery and playlist curation through conversation. Point it at your music folder, run the indexer once, and get a natural language playlist generator — accessible via web browser or API.

asmr-music

3891
from openclaw/skills

Provide professional ASMR music recommendations and playback services to help users achieve sleep aid, heart rate regulation, and focus enhancement through scientifically designed audio. Use when user wants to (1) relax, sleep, or fall asleep, (2) regulate heart rate or relieve anxiety, (3) improve focus and concentration for study or work, (4) find background white noise or ambient sounds, (5) practice meditation or deep breathing, (6) set up automatic daily reminders for ASMR listening. Keywords: sleep, relax, meditation, heart rate, anxiety, focus, concentration, white noise, ASMR, background music, calming, stress relief, daily reminder, nightly routine.

🎨 Pollinations Image Generation Skill

3891
from openclaw/skills

A clean, modular, and user-friendly image generation tool powered by [Pollinations.ai](https://pollinations.ai).