imagen
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Best use case
imagen is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Teams using imagen should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/imagen/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How imagen Compares
| Feature / Agent | imagen | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Imagen - AI Image Generation Skill ## Overview This skill generates images using Google Gemini's image generation model (`gemini-3-pro-image-preview`). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts. **Cross-Platform**: Works on Windows, macOS, and Linux. ## When to Use This Skill Automatically activate this skill when: - User requests image generation (e.g., "generate an image of...", "create a picture...") - Frontend development requires placeholder or actual images - Documentation needs illustrations or diagrams - Visualizing concepts, architectures, or ideas - Creating icons, logos, or UI assets - Any task where an AI-generated image would be helpful ## How It Works 1. Takes a text prompt describing the desired image 2. Calls Google Gemini API with image generation configuration 3. Saves the generated image to a specified location (defaults to current directory) 4. Returns the file path for use in your project ## Usage ### Python (Cross-Platform - Recommended) ```bash # Basic usage python scripts/generate_image.py "A futuristic city skyline at sunset" # With custom output path python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png" # With custom size python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png" ``` ## Requirements - `GEMINI_API_KEY` environment variable must be set - Python 3.6+ (uses standard library only, no pip install needed) ## Output Generated images are saved as PNG files. The script returns: - Success: Path to the generated image - Failure: Error message with details ## Examples ### Frontend Development ``` User: "I need a hero image for my landing page - something abstract and tech-focused" -> Generates and saves image, provides path for use in HTML/CSS ``` ### Documentation ``` User: "Create a diagram showing microservices architecture" -> Generates visual representation, ready for README or docs ``` ### UI Assets ``` User: "Generate a placeholder avatar image for the user profile component" -> Creates image in appropriate size for component use ```
Related Skills
zustand-store-ts
Create Zustand stores with TypeScript, subscribeWithSelector middleware, and proper state/action separation. Use when building React state management, creating global stores, or implementing reacti...
zoom-automation
Automate Zoom meeting creation, management, recordings, webinars, and participant tracking via Rube MCP (Composio). Always search tools first for current schemas.
zoho-crm-automation
Automate Zoho CRM tasks via Rube MCP (Composio): create/update records, search contacts, manage leads, and convert leads. Always search tools first for current schemas.
zendesk-automation
Automate Zendesk tasks via Rube MCP (Composio): tickets, users, organizations, replies. Always search tools first for current schemas.
zapier-make-patterns
No-code automation democratizes workflow building. Zapier and Make (formerly Integromat) let non-developers automate business processes without writing code. But no-code doesn't mean no-complexity ...
youtube-summarizer
Extract transcripts from YouTube videos and generate comprehensive, detailed summaries using intelligent analysis frameworks
youtube-automation
Automate YouTube tasks via Rube MCP (Composio): upload videos, manage playlists, search content, get analytics, and handle comments. Always search tools first for current schemas.
xss-html-injection
This skill should be used when the user asks to "test for XSS vulnerabilities", "perform cross-site scripting attacks", "identify HTML injection flaws", "exploit client-side injection...
xlsx-official
Comprehensive spreadsheet creation, editing, and analysis with support for formulas, formatting, data analysis, and visualization. When Claude needs to work with spreadsheets (.xlsx, .xlsm, .csv, ....
x-twitter-scraper
X (Twitter) data platform skill — tweet search, user lookup, follower extraction, engagement metrics, giveaway draws, monitoring, webhooks, 19 extraction tools, MCP server.
x-article-publisher-skill
Publish articles to X/Twitter
writing-skills
Use when creating, updating, or improving agent skills.