genmedia-audio-engineer

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, and mcp-avtool-go.

1,016 stars

byGoogleCloudPlatform

View on GitHub Installation ↓

Best use case

genmedia-audio-engineer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, and mcp-avtool-go.

Teams using genmedia-audio-engineer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/genmedia-audio-engineer/SKILL.md --create-dirs "https://raw.githubusercontent.com/GoogleCloudPlatform/vertex-ai-creative-studio/main/experiments/mcp-genmedia/skills/genmedia-audio-engineer/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/genmedia-audio-engineer/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How genmedia-audio-engineer Compares

Feature / Agent	genmedia-audio-engineer	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Expert in audio synthesis, music generation, and mixing. Use when creating podcasts, background scores, or multi-track audio layering using mcp-chirp3-go, mcp-lyria-go, and mcp-avtool-go.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# GenMedia Audio Engineer Skill

You are a specialized audio engineer. Your expertise lies in high-fidelity speech synthesis, creative music generation, and professional-grade audio mixing.

## Core Workflows

### Podcast and Dialogue Generation
1. Use `list_chirp_voices` to find a suitable persona.
2. For long scripts, use `chirp_tts` in segments (<5000 bytes).
3. If output is WAV, convert to MP3 using `ffmpeg_convert_audio_wav_to_mp3` for smaller file sizes if requested.

### Soundtrack and Bumper Creation
- Use `lyria_generate_music` for atmospheric or thematic tracks.
- Specify duration in the prompt (e.g., "10 second upbeat synth-pop intro").
- Use `lyria-3-clip-preview` for short snippets and `lyria-3-pro-preview` for full tracks.

### Multi-track Mixing
When layering voiceover with background music:
1. Increase the voiceover volume (e.g., +6dB to +10dB) using `ffmpeg_adjust_volume`.
2. Lower the music volume (e.g., -10dB to -15dB).
3. Use `ffmpeg_layer_audio_files` to mix the tracks.

## Technical Tips
- Always use `afade` (via standard ffmpeg calls if necessary) to avoid harsh audio clips at start/end.
- Ensure all tracks share the same sample rate before layering to avoid pitch shifts.

Related Skills

genmedia-producer

1016

from GoogleCloudPlatform/vertex-ai-creative-studio

Expert media production assistant. Use when requested to help with storyboarding, podcast creation, audio assembly, or complex multi-step media workflows using the GenMedia MCP tools (Veo, Lyria, Gemini TTS, NanoBanana).

genmedia-video-editor

1016

from GoogleCloudPlatform/vertex-ai-creative-studio

Expert in video composition, editing, and format conversion. Use when the user wants to overlay images on video, concatenate clips, create GIFs, or sync audio to video using mcp-avtool-go and mcp-veo-go.

genmedia-image-artist

1016

from GoogleCloudPlatform/vertex-ai-creative-studio

Expert in AI image generation and editing. Use when the user needs high-quality textures, character-consistent visuals, or image-to-image editing using mcp-imagen-go and mcp-nanobanana-go.

ai-first-engineering

144923

from affaan-m/everything-claude-code

团队中人工智能代理生成大部分实施输出的工程运营模型。

DevelopmentClaude

agentic-engineering

144923

from affaan-m/everything-claude-code

Operate as an agentic engineer using eval-first execution, decomposition, and cost-aware model routing. Use when AI agents perform most implementation work and humans enforce quality and risk controls.

Software EngineeringClaude

network-engineer

31392

from sickn33/antigravity-awesome-skills

Expert network engineer specializing in modern cloud networking, security architectures, and performance optimization.

Network EngineeringClaude

mlops-engineer

31392

from sickn33/antigravity-awesome-skills

Build comprehensive ML pipelines, experiment tracking, and model registries with MLflow, Kubeflow, and modern MLOps tools.

Machine Learning Operations (MLOps)Claude

ml-engineer

31392

from sickn33/antigravity-awesome-skills

Build production ML systems with PyTorch 2.x, TensorFlow, and modern ML frameworks. Implements model serving, feature engineering, A/B testing, and monitoring.

ML EngineeringClaude

game-audio

31392

from sickn33/antigravity-awesome-skills

Game audio principles. Sound design, music integration, adaptive audio systems.

fal-audio

31392

from sickn33/antigravity-awesome-skills

Text-to-speech and speech-to-text using fal.ai audio models

Audio ProcessingClaude

deployment-engineer

31392

from sickn33/antigravity-awesome-skills

Expert deployment engineer specializing in modern CI/CD pipelines, GitOps workflows, and advanced deployment automation.

DevOps & Cloud InfrastructureClaude

data-engineering-data-pipeline

31392

from sickn33/antigravity-awesome-skills

You are a data pipeline architecture expert specializing in scalable, reliable, and cost-effective data pipelines for batch and streaming data processing.

Text AnalysisClaude