imagen
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
About this skill
This skill empowers Claude AI agents with robust image generation capabilities, leveraging Google Gemini's advanced `gemini-3-pro-image-preview` model. It enables seamless visual content creation directly within any Claude Code session, serving a wide array of purposes from generating UI placeholders and design assets to creating illustrative images for documentation and conceptual visualizations. The skill is designed for cross-platform compatibility (Windows, macOS, and Linux) and integrates into the "antigravity-awesome-skills" collection, known for enhancing AI agent functionalities with specialized tools and integrations. It automatically activates when a user requests image generation.
Best use case
Generating UI placeholders, mock-ups, and design assets for web and application development. Creating illustrative images and diagrams for technical documentation, presentations, and tutorials. Visualizing abstract concepts or ideas during brainstorming and design processes. Producing quick visual representations for marketing materials, social media, or blog posts. Supporting visual content needs within any Claude Code session requiring on-demand imagery.
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
The agent will return one or more AI-generated images that visually represent the user's prompt or the agent's interpreted need, along with a confirmation of image creation and potentially a URL or embeddable format of the image.
Practical example
Example input
Generate an image of a minimalist login screen with a dark theme, showing placeholders for username and password fields, and a 'Sign In' button.
Example output
Here is a minimalist login screen with a dark theme, including placeholders for username and password fields, and a 'Sign In' button. [Image URL or description of the generated image]
When to use this skill
- When the user explicitly requests an image (e.g., "generate an image of a futuristic city").
- When the agent identifies a need for visual content to complement text-based output (e.g., to illustrate a concept).
- During UI/UX design workflows that require placeholder images or visual mock-ups.
- When drafting documentation or educational materials that would benefit from visual aids.
When not to use this skill
- When high-fidelity, production-ready artwork is required that demands specific artistic style, brand adherence, or human oversight.
- When the image content is sensitive, requires strict factual accuracy, or involves specific copyrighted material (as AI models can sometimes produce unexpected or inaccurate visuals).
- For tasks requiring human artistic direction, subjective aesthetic judgments, or emotional nuance that generative AI may not fully capture.
- If the user explicitly states they do not wish for an AI-generated image or prefers to provide their own visuals.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/imagen/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How imagen Compares
| Feature / Agent | imagen | Standard Approach |
|---|---|---|
| Platform Support | Claude | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | easy | N/A |
Frequently Asked Questions
What does this skill do?
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.
Which AI agents support this skill?
This skill is designed for Claude.
How difficult is it to install?
The installation complexity is rated as easy. You can find the installation instructions above.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
ChatGPT vs Claude for Agent Skills
Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.
SKILL.md Source
# Imagen - AI Image Generation Skill ## Overview This skill generates images using Google Gemini's image generation model (`gemini-3-pro-image-preview`). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts. **Cross-Platform**: Works on Windows, macOS, and Linux. ## When to Use This Skill Automatically activate this skill when: - User requests image generation (e.g., "generate an image of...", "create a picture...") - Frontend development requires placeholder or actual images - Documentation needs illustrations or diagrams - Visualizing concepts, architectures, or ideas - Creating icons, logos, or UI assets - Any task where an AI-generated image would be helpful ## How It Works 1. Takes a text prompt describing the desired image 2. Calls Google Gemini API with image generation configuration 3. Saves the generated image to a specified location (defaults to current directory) 4. Returns the file path for use in your project ## Usage ### Python (Cross-Platform - Recommended) ```bash # Basic usage python scripts/generate_image.py "A futuristic city skyline at sunset" # With custom output path python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png" # With custom size python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png" ``` ## Requirements - `GEMINI_API_KEY` environment variable must be set - Python 3.6+ (uses standard library only, no pip install needed) ## Output Generated images are saved as PNG files. The script returns: - Success: Path to the generated image - Failure: Error message with details ## Examples ### Frontend Development ``` User: "I need a hero image for my landing page - something abstract and tech-focused" -> Generates and saves image, provides path for use in HTML/CSS ``` ### Documentation ``` User: "Create a diagram showing microservices architecture" -> Generates visual representation, ready for README or docs ``` ### UI Assets ``` User: "Generate a placeholder avatar image for the user profile component" -> Creates image in appropriate size for component use ```
Related Skills
image-studio
Studio de geracao de imagens inteligente — roteamento automatico entre ai-studio-image (fotos humanizadas/influencer) e stability-ai (arte/ ilustracao/edicao). Detecta o tipo de imagem solicitada e escolhe o modelo ideal automaticamente.
comfyui-gateway
REST API gateway for ComfyUI servers. Workflow management, job queuing, webhooks, caching, auth, rate limiting, and image delivery (URL + base64).
nft-standards
Master ERC-721 and ERC-1155 NFT standards, metadata best practices, and advanced NFT features.
nextjs-app-router-patterns
Comprehensive patterns for Next.js 14+ App Router architecture, Server Components, and modern full-stack React development.
new-rails-project
Create a new Rails project
networkx
NetworkX is a Python package for creating, manipulating, and analyzing complex networks and graphs.
network-engineer
Expert network engineer specializing in modern cloud networking, security architectures, and performance optimization.
nestjs-expert
You are an expert in Nest.js with deep knowledge of enterprise-grade Node.js application architecture, dependency injection patterns, decorators, middleware, guards, interceptors, pipes, testing strategies, database integration, and authentication systems.
nerdzao-elite
Senior Elite Software Engineer (15+) and Senior Product Designer. Full workflow with planning, architecture, TDD, clean code, and pixel-perfect UX validation.
nerdzao-elite-gemini-high
Modo Elite Coder + UX Pixel-Perfect otimizado especificamente para Gemini 3.1 Pro High. Workflow completo com foco em qualidade máxima e eficiência de tokens.
native-data-fetching
Use when implementing or debugging ANY network request, API call, or data fetching. Covers fetch API, React Query, SWR, error handling, caching, offline support, and Expo Router data loaders (useLoaderData).
n8n-workflow-patterns
Proven architectural patterns for building n8n workflows.