imagen
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Best use case
imagen is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Teams using imagen should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/imagen/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How imagen Compares
| Feature / Agent | imagen | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Imagen - AI Image Generation Skill ## Overview This skill generates images using Google Gemini's image generation model (`gemini-3-pro-image-preview`). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts. **Cross-Platform**: Works on Windows, macOS, and Linux. ## When to Use This Skill Automatically activate this skill when: - User requests image generation (e.g., "generate an image of...", "create a picture...") - Frontend development requires placeholder or actual images - Documentation needs illustrations or diagrams - Visualizing concepts, architectures, or ideas - Creating icons, logos, or UI assets - Any task where an AI-generated image would be helpful ## How It Works 1. Takes a text prompt describing the desired image 2. Calls Google Gemini API with image generation configuration 3. Saves the generated image to a specified location (defaults to current directory) 4. Returns the file path for use in your project ## Usage ### Python (Cross-Platform - Recommended) ```bash # Basic usage python scripts/generate_image.py "A futuristic city skyline at sunset" # With custom output path python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png" # With custom size python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png" ``` ## Requirements - `GEMINI_API_KEY` environment variable must be set - Python 3.6+ (uses standard library only, no pip install needed) ## Output Generated images are saved as PNG files. The script returns: - Success: Path to the generated image - Failure: Error message with details ## Examples ### Frontend Development ``` User: "I need a hero image for my landing page - something abstract and tech-focused" -> Generates and saves image, provides path for use in HTML/CSS ``` ### Documentation ``` User: "Create a diagram showing microservices architecture" -> Generates visual representation, ready for README or docs ``` ### UI Assets ``` User: "Generate a placeholder avatar image for the user profile component" -> Creates image in appropriate size for component use ```
Related Skills
bgo
Automates the complete Blender build-go workflow, from building and packaging your extension/add-on to removing old versions, installing, enabling, and launching Blender for quick testing and iteration.
humanizer-ko
Detects and corrects Korean AI writing patterns to transform text into natural human writing. Based on scientific linguistic research (KatFishNet paper with 94.88% AUC accuracy). Analyzes 19 patterns including comma overuse, spacing rigidity, POS diversity, AI vocabulary overuse, and structural monotony. Use when humanizing Korean text from ChatGPT/Claude/Gemini or removing AI traces from Korean LLM output.
huggingface-accelerate
Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/FSDP/Megatron/DDP. Automatic device placement, mixed precision (FP16/BF16/FP8). Interactive config, single launch command. HuggingFace ecosystem standard.
hr-pro
Professional, ethical HR partner for hiring, onboarding/offboarding, PTO and leave, performance, compliant policies, and employee relations. Ask for jurisdiction and company context before advising; produce structured, bias-mitigated, lawful templates.
hive-mind-advanced
Advanced Hive Mind collective intelligence system for queen-led multi-agent coordination with consensus mechanisms and persistent memory
hire
Interactive hiring wizard to set up a new AI team member. Guides the user through role design via conversation, generates agent identity files, and optionally sets up performance reviews. Use when the user wants to hire, add, or set up a new AI agent, team member, or assistant. Triggers on phrases like "hire", "add an agent", "I need help with X" (implying a new role), or "/hire".
hic-tad-calling
This skill should be used when users need to identify topologically associating domains (TADs) from Hi-C data in .mcools (or .cool) files or when users want to visualize the TAD in target genome loci. It provides workflows for TAD calling and visualization.
helix-memory
Long-term memory system for Claude Code using HelixDB graph-vector database. Store and retrieve facts, preferences, context, and relationships across sessions using semantic search, reasoning chains, and time-window filtering.
heath-ledger
AI bookkeeping agent for Mercury bank accounts. Pulls transactions, categorizes them (rule-based + AI), and generates Excel workbooks with P&L, Balance Sheet, Cash Flow, and transaction detail. Use when the user wants to do bookkeeping, generate financial statements, categorize bank transactions, connect Mercury, or produce monthly/quarterly/annual books. Triggers on: bookkeeping, P&L, profit and loss, balance sheet, cash flow, financial statements, Mercury bank, categorize transactions, generate books, monthly close.
health-chat
Unified health conversation entry point - automatically loads all health data for each conversation, supports natural language queries, and intelligently routes to appropriate health data processing
hackernews
Comprehensive toolkit for fetching, searching, analyzing, and monitoring Hacker News content. Use when Claude needs to interact with Hacker News for (1) Fetching top/new/best/ask/show/job stories, (2) Searching for specific topics or keywords, (3) Monitoring specific users or tracking their activity, (4) Analyzing trending topics and patterns, (5) Getting story details, comments, or user profiles, or (6) Any other task involving Hacker News data retrieval or analysis.
GSTD A2A Network
Decentralized Agent-to-Agent Autonomous Economy. Connects hardware and agents for distributed compute, hive memory access, and economic settlement.