ai-music-generation

Generate AI music and songs with Diffrythm, Tencent Song Generation via inference.sh CLI. Models: Diffrythm (fast song generation), Tencent Song Generation (full songs with vocals). Capabilities: text-to-music, song generation, instrumental, lyrics to song, soundtrack creation. Use for: background music, social media content, game soundtracks, podcasts, royalty-free music. Triggers: music generation, ai music, generate song, ai composer, text to music, song generator, create music with ai, suno alternative, udio alternative, ai song, ai soundtrack, generate soundtrack, ai jingle, music ai, beat generator

25 stars

Best use case

ai-music-generation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate AI music and songs with Diffrythm, Tencent Song Generation via inference.sh CLI. Models: Diffrythm (fast song generation), Tencent Song Generation (full songs with vocals). Capabilities: text-to-music, song generation, instrumental, lyrics to song, soundtrack creation. Use for: background music, social media content, game soundtracks, podcasts, royalty-free music. Triggers: music generation, ai music, generate song, ai composer, text to music, song generator, create music with ai, suno alternative, udio alternative, ai song, ai soundtrack, generate soundtrack, ai jingle, music ai, beat generator

Teams using ai-music-generation should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ai-music-generation/SKILL.md --create-dirs "https://raw.githubusercontent.com/ComeOnOliver/skillshub/main/skills/Harmeet10000/skills/ai-music-generation/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/ai-music-generation/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How ai-music-generation Compares

Feature / Agentai-music-generationStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate AI music and songs with Diffrythm, Tencent Song Generation via inference.sh CLI. Models: Diffrythm (fast song generation), Tencent Song Generation (full songs with vocals). Capabilities: text-to-music, song generation, instrumental, lyrics to song, soundtrack creation. Use for: background music, social media content, game soundtracks, podcasts, royalty-free music. Triggers: music generation, ai music, generate song, ai composer, text to music, song generator, create music with ai, suno alternative, udio alternative, ai song, ai soundtrack, generate soundtrack, ai jingle, music ai, beat generator

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# AI Music Generation

Generate music and songs via [inference.sh](https://inference.sh) CLI.

![AI Music Generation](https://cloud.inference.sh/u/4mg21r6ta37mpaz6ktzwtt8krr/01jz01qvx0gdcyvhvhpfjjb6s4.png)

## Quick Start

```bash
# Install CLI
curl -fsSL https://cli.inference.sh | sh && infsh login

# Generate a song
infsh app run infsh/diffrythm --input '{"prompt": "upbeat electronic dance track"}'
```

> **Install note:** The [install script](https://cli.inference.sh) only detects your OS/architecture, downloads the matching binary from `dist.inference.sh`, and verifies its SHA-256 checksum. No elevated permissions or background processes. [Manual install & verification](https://dist.inference.sh/cli/checksums.txt) available.

## Available Models

| Model | App ID | Best For |
|-------|--------|----------|
| Diffrythm | `infsh/diffrythm` | Fast song generation |
| Tencent Song | `infsh/tencent-song-generation` | Full songs with vocals |

## Browse Audio Apps

```bash
infsh app list --category audio
```

## Examples

### Instrumental Track

```bash
infsh app run infsh/diffrythm --input '{
  "prompt": "cinematic orchestral soundtrack, epic and dramatic"
}'
```

### Song with Vocals

```bash
infsh app sample infsh/tencent-song-generation --save input.json

# Edit input.json:
# {
#   "prompt": "pop song about summer love",
#   "lyrics": "Walking on the beach with you..."
# }

infsh app run infsh/tencent-song-generation --input input.json
```

### Background Music for Video

```bash
infsh app run infsh/diffrythm --input '{
  "prompt": "calm lo-fi hip hop beat, study music, relaxing"
}'
```

### Podcast Intro

```bash
infsh app run infsh/diffrythm --input '{
  "prompt": "short podcast intro jingle, professional, tech themed, 10 seconds"
}'
```

### Game Soundtrack

```bash
infsh app run infsh/diffrythm --input '{
  "prompt": "retro 8-bit video game music, adventure theme, chiptune"
}'
```

## Prompt Tips

**Genre keywords**: pop, rock, electronic, jazz, classical, hip-hop, lo-fi, ambient, orchestral

**Mood keywords**: happy, sad, energetic, calm, dramatic, epic, mysterious, uplifting

**Instrument keywords**: piano, guitar, synth, drums, strings, brass, choir

**Structure keywords**: intro, verse, chorus, bridge, outro, loop

## Use Cases

- **Social Media**: Background music for videos
- **Podcasts**: Intro/outro jingles
- **Games**: Soundtracks and effects
- **Videos**: Background scores
- **Ads**: Commercial jingles
- **Content Creation**: Royalty-free music

## Related Skills

```bash
# Full platform skill (all 150+ apps)
npx skills add inference-sh/skills@inference-sh

# Text-to-speech
npx skills add inference-sh/skills@text-to-speech

# Video generation (add music to videos)
npx skills add inference-sh/skills@ai-video-generation

# Speech-to-text
npx skills add inference-sh/skills@speech-to-text
```

Browse all apps: `infsh app list`

## Documentation

- [Running Apps](https://inference.sh/docs/apps/running) - How to run apps via CLI
- [Content Pipeline Example](https://inference.sh/docs/examples/content-pipeline) - Building media workflows
- [Apps Overview](https://inference.sh/docs/apps/overview) - Understanding the app ecosystem

Related Skills

youtube-music

25
from ComeOnOliver/skillshub

Search and play music tracks on YouTube Music through MCP integration. Use when user wants to search for songs, play music, or discover tracks on YouTube Music platform.

apify-lead-generation

25
from ComeOnOliver/skillshub

Generates B2B/B2C leads by scraping Google Maps, websites, Instagram, TikTok, Facebook, LinkedIn, YouTube, and Google Search. Use when user asks to find leads, prospects, businesses, build lead lists, enrich contacts, or scrape profiles for sales outreach.

skywork-music-maker

25
from ComeOnOliver/skillshub

Create professional music with Mureka AI API — songs, instrumentals, and lyrics from natural language descriptions in any language. Use when users want to generate a song, create a beat or instrumental, write lyrics, clone vocals, upload reference tracks, or do anything related to AI music creation, even casual requests like "make me a chill lo-fi beat".

podcast-generation

25
from ComeOnOliver/skillshub

Generate AI-powered podcast-style audio narratives using Azure OpenAI's GPT Realtime Mini model via WebSocket. Use when building text-to-speech features, audio narrative generation, podcast creation from content, or integrating with Azure OpenAI Realtime API for real audio output. Covers full-stack implementation from React frontend to Python FastAPI backend with WebSocket streaming.

documentation-generation-doc-generate

25
from ComeOnOliver/skillshub

You are a documentation expert specializing in creating comprehensive, maintainable documentation from code. Generate API docs, architecture diagrams, user guides, and technical references using AI-powered analysis and industry best practices.

elevenlabs-music

25
from ComeOnOliver/skillshub

ElevenLabs AI music generation - create original music from text prompts via inference.sh CLI. Capabilities: text-to-music, custom duration up to 10 minutes, genre/mood/instrument control, royalty-free commercial use. Use for: background music, soundtracks, jingles, podcasts, video scores, game audio. Triggers: elevenlabs music, eleven labs music, ai music, generate music, music generation, compose music, ai composer, create song, soundtrack, background music, jingle, elevenlabs compose, music ai

ai-video-generation

25
from ComeOnOliver/skillshub

Generate AI videos with Google Veo, Seedance, Wan, Grok and 40+ models via inference.sh CLI. Models: Veo 3.1, Veo 3, Seedance 1.5 Pro, Wan 2.5, Grok Imagine Video, OmniHuman, Fabric, HunyuanVideo. Capabilities: text-to-video, image-to-video, lipsync, avatar animation, video upscaling, foley sound. Use for: social media videos, marketing content, explainer videos, product demos, AI avatars. Triggers: video generation, ai video, text to video, image to video, veo, animate image, video from image, ai animation, video generator, generate video, t2v, i2v, ai video maker, create video with ai, runway alternative, pika alternative, sora alternative, kling alternative

ai-image-generation

25
from ComeOnOliver/skillshub

Generate AI images with FLUX, Gemini, Grok, Seedream, Reve and 50+ models via inference.sh CLI. Models: FLUX Dev LoRA, FLUX.2 Klein LoRA, Gemini 3 Pro Image, Grok Imagine, Seedream 4.5, Reve, ImagineArt. Capabilities: text-to-image, image-to-image, inpainting, LoRA, image editing, upscaling, text rendering. Use for: AI art, product mockups, concept art, social media graphics, marketing visuals, illustrations. Triggers: flux, image generation, ai image, text to image, stable diffusion, generate image, ai art, midjourney alternative, dall-e alternative, text2img, t2i, image generator, ai picture, create image with ai, generative ai, ai illustration, grok image, gemini image

when-creating-presentations-use-pptx-generation

25
from ComeOnOliver/skillshub

Enterprise-grade PowerPoint deck generation using evidence-based prompting, workflow enforcement, constraint-based design

pptx-generation

25
from ComeOnOliver/skillshub

Enterprise-grade PowerPoint deck generation system using evidence-based prompting techniques, workflow enforcement, and constraint-based design. Use when creating professional presentations (board decks, reports, analyses) requiring consistent visual quality, accessibility compliance, and integration of complex data from multiple sources. Implements html2pptx workflow with spatial layout optimization, validation gates, and multi-chat architecture for 30+ slide decks.

agent-generation

25
from ComeOnOliver/skillshub

This skill provides knowledge for generating effective Claude Code agents tailored to specific projects. It is used internally by the agent-team-creator plugin when analyzing codebases and creating specialized agent teams. Contains templates, best practices, and patterns for writing project-aware agents.

video-generation-skill

25
from ComeOnOliver/skillshub

Design video concepts, scripts, shotlists, transitions, and editing notes for VEO, Gemini, and Nano Banana-based pipelines. Use when turning a marketing idea into concrete video assets.