imagen
Generate images using Google Gemini's image generation capabilities. Use this skill when the user needs to create, generate, or produce images for any purpose including UI mockups, icons, illustrations, diagrams, concept art, placeholder images, or visual representations.
Best use case
imagen is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate images using Google Gemini's image generation capabilities. Use this skill when the user needs to create, generate, or produce images for any purpose including UI mockups, icons, illustrations, diagrams, concept art, placeholder images, or visual representations.
Teams using imagen should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/imagen/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How imagen Compares
| Feature / Agent | imagen | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate images using Google Gemini's image generation capabilities. Use this skill when the user needs to create, generate, or produce images for any purpose including UI mockups, icons, illustrations, diagrams, concept art, placeholder images, or visual representations.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Imagen - AI Image Generation Skill ## Overview This skill generates images using Google Gemini's image generation model (`gemini-3-pro-image-preview`). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts. **Cross-Platform**: Works on Windows, macOS, and Linux. ## When to Use This Skill Automatically activate this skill when: - User requests image generation (e.g., "generate an image of...", "create a picture...") - Frontend development requires placeholder or actual images - Documentation needs illustrations or diagrams - Visualizing concepts, architectures, or ideas - Creating icons, logos, or UI assets - Any task where an AI-generated image would be helpful ## How It Works 1. Takes a text prompt describing the desired image 2. Calls Google Gemini API with image generation configuration 3. Saves the generated image to a specified location (defaults to current directory) 4. Returns the file path for use in your project ## Usage ### Python (Cross-Platform - Recommended) ```bash # Basic usage python scripts/generate_image.py "A futuristic city skyline at sunset" # With custom output path python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png" # With custom size python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png" ``` ## Requirements - `GEMINI_API_KEY` environment variable must be set - Python 3.6+ (uses standard library only, no pip install needed) ## Output Generated images are saved as PNG files. The script returns: - Success: Path to the generated image - Failure: Error message with details ## Examples ### Frontend Development ``` User: "I need a hero image for my landing page - something abstract and tech-focused" -> Generates and saves image, provides path for use in HTML/CSS ``` ### Documentation ``` User: "Create a diagram showing microservices architecture" -> Generates visual representation, ready for README or docs ``` ### UI Assets ``` User: "Generate a placeholder avatar image for the user profile component" -> Creates image in appropriate size for component use ```
Related Skills
cost-optimization
Optimize cloud costs through resource rightsizing, tagging strategies, reserved instances, and spending analysis. Use when reducing cloud expenses, analyzing infrastructure costs, or implementing c...
copilot-usage-metrics
Retrieve and display GitHub Copilot usage metrics for organizations and enterprises using the GitHub CLI and REST API.
copilot-sdk
Build agentic applications with GitHub Copilot SDK. Use when embedding AI agents in apps, creating custom tools, implementing streaming responses, managing sessions, connecting to MCP servers, or creating custom agents. Triggers on Copilot SDK, GitHub SDK, agentic app, embed Copilot, programmable agent, MCP server, custom agent.
copilot-instructions-blueprint-generator
Technology-agnostic blueprint generator for creating comprehensive copilot-instructions.md files that guide GitHub Copilot to produce code consistent with project standards, architecture patterns, and exact technology versions by analyzing existing codebase patterns and avoiding assumptions.
copilot-docs
Configure GitHub Copilot with custom instructions. Use when setting up .github/copilot-instructions.md, customizing Copilot behavior, or creating repository-specific AI guidance. Triggers on Copilot instructions, copilot-instructions.md, GitHub Copilot config.
conventional-commit
Prompt and workflow for generating conventional commit messages using a structured XML format. Guides users to create standardized, descriptive commit messages in line with the Conventional Commits specification, including instructions, examples, and validation.
contagious
Engineer word-of-mouth and virality using the STEPPS framework (Social Currency, Triggers, Emotion, Public, Practical Value, Stories). Use when the user mentions "go viral", "word of mouth", "shareable content", "social currency", or "why people share". Covers environmental triggers and high-arousal emotional content. For sticky messaging, see made-to-stick. For persuasion tactics, see influence-psychology.
constant-time-analysis
Detects timing side-channel vulnerabilities in cryptographic code. Use when implementing or reviewing crypto code, encountering division on secrets, secret-dependent branches, or constant-time programming questions in C, C++, Go, Rust, Swift, Java, Kotlin, C#, PHP, JavaScript, TypeScript, Python, or Ruby.
connect
Connect Claude to any app. Send emails, create issues, post messages, update databases - take real actions across Gmail, Slack, GitHub, Notion, and 1000+ services.
connect-apps
Connect Claude to external apps like Gmail, Slack, GitHub. Use this skill when the user wants to send emails, create issues, post messages, or take actions in external services.
conductor-revert
Git-aware undo by logical work unit (track, phase, or task)
git:compare-worktrees
Compare files and directories between git worktrees or worktree and current branch