imagen

AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.

31,392 stars
Complexity: easy

About this skill

This skill empowers Claude AI agents with robust image generation capabilities, leveraging Google Gemini's advanced `gemini-3-pro-image-preview` model. It enables seamless visual content creation directly within any Claude Code session, serving a wide array of purposes from generating UI placeholders and design assets to creating illustrative images for documentation and conceptual visualizations. The skill is designed for cross-platform compatibility (Windows, macOS, and Linux) and integrates into the "antigravity-awesome-skills" collection, known for enhancing AI agent functionalities with specialized tools and integrations. It automatically activates when a user requests image generation.

Best use case

Generating UI placeholders, mock-ups, and design assets for web and application development. Creating illustrative images and diagrams for technical documentation, presentations, and tutorials. Visualizing abstract concepts or ideas during brainstorming and design processes. Producing quick visual representations for marketing materials, social media, or blog posts. Supporting visual content needs within any Claude Code session requiring on-demand imagery.

AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.

The agent will return one or more AI-generated images that visually represent the user's prompt or the agent's interpreted need, along with a confirmation of image creation and potentially a URL or embeddable format of the image.

Practical example

Example input

Generate an image of a minimalist login screen with a dark theme, showing placeholders for username and password fields, and a 'Sign In' button.

Example output

Here is a minimalist login screen with a dark theme, including placeholders for username and password fields, and a 'Sign In' button. [Image URL or description of the generated image]

When to use this skill

  • When the user explicitly requests an image (e.g., "generate an image of a futuristic city").
  • When the agent identifies a need for visual content to complement text-based output (e.g., to illustrate a concept).
  • During UI/UX design workflows that require placeholder images or visual mock-ups.
  • When drafting documentation or educational materials that would benefit from visual aids.

When not to use this skill

  • When high-fidelity, production-ready artwork is required that demands specific artistic style, brand adherence, or human oversight.
  • When the image content is sensitive, requires strict factual accuracy, or involves specific copyrighted material (as AI models can sometimes produce unexpected or inaccurate visuals).
  • For tasks requiring human artistic direction, subjective aesthetic judgments, or emotional nuance that generative AI may not fully capture.
  • If the user explicitly states they do not wish for an AI-generated image or prefers to provide their own visuals.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/imagen/SKILL.md --create-dirs "https://raw.githubusercontent.com/sickn33/antigravity-awesome-skills/main/plugins/antigravity-awesome-skills-claude/skills/imagen/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/imagen/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How imagen Compares

Feature / AgentimagenStandard Approach
Platform SupportClaudeLimited / Varies
Context Awareness High Baseline
Installation ComplexityeasyN/A

Frequently Asked Questions

What does this skill do?

AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.

Which AI agents support this skill?

This skill is designed for Claude.

How difficult is it to install?

The installation complexity is rated as easy. You can find the installation instructions above.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Imagen - AI Image Generation Skill

## Overview

This skill generates images using Google Gemini's image generation model (`gemini-3-pro-image-preview`). It enables seamless image creation during any Claude Code session - whether you're building frontend UIs, creating documentation, or need visual representations of concepts.

**Cross-Platform**: Works on Windows, macOS, and Linux.

## When to Use This Skill

Automatically activate this skill when:
- User requests image generation (e.g., "generate an image of...", "create a picture...")
- Frontend development requires placeholder or actual images
- Documentation needs illustrations or diagrams
- Visualizing concepts, architectures, or ideas
- Creating icons, logos, or UI assets
- Any task where an AI-generated image would be helpful

## How It Works

1. Takes a text prompt describing the desired image
2. Calls Google Gemini API with image generation configuration
3. Saves the generated image to a specified location (defaults to current directory)
4. Returns the file path for use in your project

## Usage

### Python (Cross-Platform - Recommended)

```bash
# Basic usage
python scripts/generate_image.py "A futuristic city skyline at sunset"

# With custom output path
python scripts/generate_image.py "A minimalist app icon for a music player" "./assets/icons/music-icon.png"

# With custom size
python scripts/generate_image.py --size 2K "High resolution landscape" "./wallpaper.png"
```

## Requirements

- `GEMINI_API_KEY` environment variable must be set
- Python 3.6+ (uses standard library only, no pip install needed)

## Output

Generated images are saved as PNG files. The script returns:
- Success: Path to the generated image
- Failure: Error message with details

## Examples

### Frontend Development
```
User: "I need a hero image for my landing page - something abstract and tech-focused"
-> Generates and saves image, provides path for use in HTML/CSS
```

### Documentation
```
User: "Create a diagram showing microservices architecture"
-> Generates visual representation, ready for README or docs
```

### UI Assets
```
User: "Generate a placeholder avatar image for the user profile component"
-> Creates image in appropriate size for component use
```

Related Skills

image-studio

31392
from sickn33/antigravity-awesome-skills

Studio de geracao de imagens inteligente — roteamento automatico entre ai-studio-image (fotos humanizadas/influencer) e stability-ai (arte/ ilustracao/edicao). Detecta o tipo de imagem solicitada e escolhe o modelo ideal automaticamente.

Image GenerationClaudeCursorGemini

comfyui-gateway

31392
from sickn33/antigravity-awesome-skills

REST API gateway for ComfyUI servers. Workflow management, job queuing, webhooks, caching, auth, rate limiting, and image delivery (URL + base64).

Image GenerationClaudeCursorGemini

nft-standards

31392
from sickn33/antigravity-awesome-skills

Master ERC-721 and ERC-1155 NFT standards, metadata best practices, and advanced NFT features.

Web3 & BlockchainClaude

nextjs-app-router-patterns

31392
from sickn33/antigravity-awesome-skills

Comprehensive patterns for Next.js 14+ App Router architecture, Server Components, and modern full-stack React development.

Web FrameworksClaude

new-rails-project

31392
from sickn33/antigravity-awesome-skills

Create a new Rails project

Code GenerationClaude

networkx

31392
from sickn33/antigravity-awesome-skills

NetworkX is a Python package for creating, manipulating, and analyzing complex networks and graphs.

Network AnalysisClaude

network-engineer

31392
from sickn33/antigravity-awesome-skills

Expert network engineer specializing in modern cloud networking, security architectures, and performance optimization.

Network EngineeringClaude

nestjs-expert

31392
from sickn33/antigravity-awesome-skills

You are an expert in Nest.js with deep knowledge of enterprise-grade Node.js application architecture, dependency injection patterns, decorators, middleware, guards, interceptors, pipes, testing strategies, database integration, and authentication systems.

Frameworks & LibrariesClaude

nerdzao-elite

31392
from sickn33/antigravity-awesome-skills

Senior Elite Software Engineer (15+) and Senior Product Designer. Full workflow with planning, architecture, TDD, clean code, and pixel-perfect UX validation.

Software DevelopmentClaude

nerdzao-elite-gemini-high

31392
from sickn33/antigravity-awesome-skills

Modo Elite Coder + UX Pixel-Perfect otimizado especificamente para Gemini 3.1 Pro High. Workflow completo com foco em qualidade máxima e eficiência de tokens.

Software DevelopmentClaudeGemini

native-data-fetching

31392
from sickn33/antigravity-awesome-skills

Use when implementing or debugging ANY network request, API call, or data fetching. Covers fetch API, React Query, SWR, error handling, caching, offline support, and Expo Router data loaders (useLoaderData).

API IntegrationClaude

n8n-workflow-patterns

31392
from sickn33/antigravity-awesome-skills

Proven architectural patterns for building n8n workflows.

Workflow AutomationClaude