muapi-media-generation

Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5

3,891 stars

Best use case

muapi-media-generation is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5

Teams using muapi-media-generation should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/muapi-media/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/anil-matcha/muapi-media/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/muapi-media/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How muapi-media-generation Compares

Feature / Agentmuapi-media-generationStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate AI images, videos, music, and audio from the terminal via muapi.ai — supports 100+ models including Flux, Midjourney v7, Kling 3.0, Veo3, and Suno V5

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# 🎨 MuAPI Media Generation

**Schema-driven generation primitives for images, videos, and audio.**

Generate professional-grade media directly from the terminal using 100+ state-of-the-art AI models. All scripts are powered by `schema_data.json` for dynamic model and endpoint resolution.

## Available Scripts

| Script | Description | Default Model |
| :--- | :--- | :--- |
| `generate-image.sh` | Text-to-image generation | `flux-dev` |
| `generate-video.sh` | Text-to-video generation | `minimax-pro` |
| `image-to-video.sh` | Animate a static image into video | `kling-pro` |
| `create-music.sh` | Music creation, remix, extend, text/video-to-audio | Suno V5 |
| `upload.sh` | Upload local files to CDN for use with other skills | — |

## Quick Start

```bash
# Generate an image
bash generate-image.sh --prompt "a sunset over mountains" --model flux-dev --view

# Generate a video
bash generate-video.sh --prompt "ocean waves at golden hour" --model minimax-pro --view

# Animate an image
bash image-to-video.sh --image-url "https://..." --prompt "camera slowly pans right" --model kling-pro

# Create music
bash create-music.sh --style "lo-fi hip hop" --prompt "chill beats for studying"

# Upload a local file
bash upload.sh --file ./my-image.jpg
```

## Common Flags

All scripts support: `--async`, `--view`, `--json`, `--timeout N`, `--help`

## Requirements

- `MUAPI_KEY` environment variable (set via `core/platform/setup.sh`)
- `curl`, `jq`, `python3`

Related Skills

openclaw-media-gen

3891
from openclaw/skills

Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.

Content & Documentation

media-compress

3891
from openclaw/skills

Compress and convert images and videos using ffmpeg. Use when the user wants to reduce file size, change format, resize, or optimize media files. Handles common formats like JPG, PNG, WebP, MP4, MOV, WebM. Triggers on phrases like "compress image", "compress video", "reduce file size", "convert to webp/mp4", "resize image", "make image smaller", "batch compress", "optimize media".

General Utilities

cliproxy-media

3891
from openclaw/skills

Analyze images (jpg, png, gif, webp) and PDFs via CLIProxyAPI — a Claude Max proxy that routes requests through your subscription at zero extra cost. Use this skill whenever you need to analyze, describe, or extract information from an image or photo ("analyze image", "describe photo", "what is in this picture"), read or summarize a PDF document ("read PDF", "summary of this document"), or process any media file via a CLIProxy-compatible endpoint ("process media via proxy", "cliproxy vision", "cliproxy media"). NEVER use the built-in `image` or `pdf` tools when using CLIProxyAPI — they fall back to direct Anthropic API which requires separate credits. Use this skill instead for all vision and document analysis tasks.

openrouter-image-generation

3891
from openclaw/skills

Generate or edit images through OpenRouter's multimodal image generation endpoint (`/api/v1/chat/completions`) using OpenRouter-compatible image models. Use for text-to-image or image-to-image requests when the user wants OpenRouter, `OPENROUTER_API_KEY`, model overrides, or provider-specific `image_config` options.

social-media-agent

3891
from openclaw/skills

Automated social media manager — plan, write, schedule, and analyze content across X/Twitter, LinkedIn, Instagram, TikTok, Facebook, and Pinterest. Integrates with Buffer (free) or Postiz (self-hosted) for scheduling.

PDF Generation Skill

3891
from openclaw/skills

**Purpose:** Generate professional PDFs from HTML/CSS without whitespace gaps or layout issues.

social-media-content-scraper-pro

3891
from openclaw/skills

Social Media Content Bulk Scraper, extract articles/posts from WeChat, Instagram, TikTok, YouTube, export to Markdown/HTML with full metadata. $0.005 USDT per use.

siliconflow-media

3891
from openclaw/skills

SiliconFlow 多模态服务,支持图片生成(FLUX/Qwen)、视频生成(Wan)、TTS语音合成、ASR语音识别。使用代金券支付。

lead-generation

3891
from openclaw/skills

Lead Generation — Find high-intent buyers in live Twitter, Instagram, and Reddit conversations. Auto-researches your product, generates targeted search queries, and discovers people actively looking for solutions you offer. Social selling and prospecting powered by 1.5B+ indexed posts via Xpoz MCP.

Music Playlist Generation (PlaylistGen)

3891
from openclaw/skills

LLM-powered playlist generation for your local music library that contains music audio files such as mp3, flac, m4a, etc. Integrate this skill with your Agents (e.g., OpenClaw) to enable natural language music discovery and playlist curation through conversation. Point it at your music folder, run the indexer once, and get a natural language playlist generator — accessible via web browser or API.

Macrocosmos SN13 API - Social Media Data Skill

3891
from openclaw/skills

Fetch real-time social media data from X (Twitter) and Reddit by keyword, username, date range, and filters with engagement metrics via Macrocosmos SN13 API on Bittensor.

🎨 Pollinations Image Generation Skill

3891
from openclaw/skills

A clean, modular, and user-friendly image generation tool powered by [Pollinations.ai](https://pollinations.ai).