nano-image-generate
Generate images using Nano Banana (Flash) or Nano Banana Pro. Use 'flash' for speed/efficiency and 'pro' for high quality, text rendering, and complex prompt adherence. Triggers include 'generate image', 'create logo', 'fast image', 'high quality image'.
Best use case
nano-image-generate is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate images using Nano Banana (Flash) or Nano Banana Pro. Use 'flash' for speed/efficiency and 'pro' for high quality, text rendering, and complex prompt adherence. Triggers include 'generate image', 'create logo', 'fast image', 'high quality image'.
Teams using nano-image-generate should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/nano-image-generate/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How nano-image-generate Compares
| Feature / Agent | nano-image-generate | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate images using Nano Banana (Flash) or Nano Banana Pro. Use 'flash' for speed/efficiency and 'pro' for high quality, text rendering, and complex prompt adherence. Triggers include 'generate image', 'create logo', 'fast image', 'high quality image'.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Nano Image Generator Generate images using Nano Banana Pro (Gemini 3 Pro Preview) for any visual asset needs. Supports **reference images** for style transfer and character consistency. ## Quick Start > [!IMPORTANT] > **Path Check**: Before running, localize the `scripts/generate_image.py` file. If running this skill from another project, use the absolute path to the script or copy it to your project. ```bash # Basic generation (saves to ./generated/) # Ensure you are running this from the directory containing 'scripts/' OR provide full path python scripts/generate_image.py "A friendly robot mascot waving" # With style reference (keep same visual style) python scripts/generate_image.py "Same style, new content" --ref ./reference.jpg --output ./new.png ``` ## Model Selection Choose the model based on user intent: | Model | ID | Use Case | Keywords | |-------|----|----------|----------| | **Pro** | `gemini-3-pro-image-preview` | **Quality & Detail**. Default choice. Best for text, complex prompts, logos, and high-fidelity assets. | "high quality", "logo", "text", "detail", "complex" | | **Flash** | `gemini-2.5-flash-image` | **Speed & Efficiency**. Best for rapid prototyping, large batches, or when low latency is critical. | "fast", "quick", "draft", "rapid" | ## Script Usage ```bash python scripts/generate_image.py <prompt> --output <path> [options] ``` **Required:** - `prompt` - Image description **Options:** - `--output, -o` - Output file path. Optional. Defaults to `./generated/<slug>.png`. - `--model, -m` - Model selection: `pro` (default) or `flash` - `--key, -k` - Gemini API Key (optional) - `--aspect, -a` - Aspect ratio (default: `1:1`) - `--size, -s` - Resolution: `1K`, `2K` (default), `4K` - `--ref, -r` - Reference image (max 14) ## Examples **High Quality Logo (Pro - Default):** ```bash python scripts/generate_image.py "Futuristic company logo with text 'NANO'" --output ./logo.png ``` **Quick Prototype (Flash):** ```bash python scripts/generate_image.py "Draft sketch of a cat" --model flash --output ./cat_draft.png ``` ## Reference Images Gemini supports up to **14 reference images** for: ### Style Transfer Keep the visual style (colors, textures, mood) from a reference: ```bash python scripts/generate_image.py "New scene with mountains, same visual style as reference" \ --ref ./style-reference.jpg --output ./styled-mountains.png ``` ### Character Consistency Maintain character appearance across multiple images: ```bash python scripts/generate_image.py "Same character now in a forest setting" \ --ref ./character.png --output ./character-forest.png ``` ### Multi-Image Fusion Combine elements from multiple references: ```bash python scripts/generate_image.py "Combine the style of first image with subject of second" \ --ref ./style.png --ref ./subject.png --output ./combined.png ``` ### Serial Image Generation (Batch Workflow) For generating a series with consistent style: 1. Generate first image 2. Use first image as `--ref` for subsequent images 3. Each new image inherits the established style ```bash # Generate cover python scripts/generate_image.py "Tech knowledge card cover" -o ./01-cover.png # Generate subsequent cards with style reference python scripts/generate_image.py "Card 2 content, same style" --ref ./01-cover.png -o ./02-card.png python scripts/generate_image.py "Card 3 content, same style" --ref ./01-cover.png -o ./03-card.png ``` ## Directory Structure To keep things organized, this skill uses a standard directory structure: - `./generated/` - **Default output location**. Images are automatically saved here if no path is provided. - `./assets/` - Recommended for storing reference images. ## Workflow 1. **Verify Output Directory & Script Path**: - Ensure the output directory exists (e.g., `mkdir -p generated`) or is writable. - Confirm the location of `generate_image.py`. 2. **Craft effective prompts** - Be specific and descriptive: - Include style: "flat design", "3D rendered", "watercolor", "minimalist" - Include context: "for a mobile app", "website hero image" - Include details: colors, mood, composition - For references: mention "same style as reference" or "keep the visual style" 3. **Choose appropriate settings:** - Icons/logos → `--aspect 1:1` - Banners/headers → `--aspect 16:9` or `21:9` - Mobile screens → `--aspect 9:16` - Xiaohongshu cards → `--aspect 3:4` - Photos → `--aspect 3:2` or `4:3` ## Examples **App icon:** ```bash python scripts/generate_image.py "Minimalist flat design app icon of a lightning bolt, purple gradient background, modern iOS style" \ --output ./assets/app-icon.png --aspect 1:1 ``` **Marketing banner:** ```bash python scripts/generate_image.py "Professional website hero banner for a productivity app, abstract geometric shapes, blue and white color scheme" \ --output ./public/images/hero-banner.png --aspect 16:9 ``` **Xiaohongshu knowledge card:** ```bash python scripts/generate_image.py "Tech knowledge card, dark blue purple gradient, neon cyan accents, code block style, Chinese text '标题'" \ --output ./xiaohongshu/card.png --aspect 3:4 ``` **Style transfer:** ```bash python scripts/generate_image.py "Transform this photo into watercolor painting style" \ --ref ./photo.jpg --output ./watercolor.png ``` **Character in new scene:** ```bash python scripts/generate_image.py "Same character from reference, now sitting in a cafe, warm lighting" \ --ref ./character.png --output ./character-cafe.png --aspect 3:2 ``` ## Prompt Tips - **Be specific** - "A red apple on a wooden table" vs "an apple" - **Include style** - "in the style of pixel art" or "photorealistic" - **Mention purpose** - "for a children's book" affects the output style - **Describe composition** - "centered", "rule of thirds", "close-up" - **Specify colors** - Explicit color palettes yield better results - **Reference prompts** - Use "same style as reference", "keep the visual aesthetic", "match the color palette" - **Avoid** - Don't ask for complex text in images (use overlays instead) ## Limitations - Maximum 14 reference images per request - Text rendering may be imperfect (better to overlay text separately) - Very specific brand logos may not reproduce exactly
Related Skills
generate-llms
Generate llms.txt and llms-full.txt files for AI agent consumption following the llmstxt.org standard. Use when updating site content that should be reflected in the llms files, or when building/deploying the site.
all-images-ai-automation
Automate All Images AI tasks via Rube MCP (Composio). Always search tools first for current schemas.
ai-image-generator
使用 ModelScope 等平台生成 AI 图像。当用户需要生成图像、设计图标、创建角色立绘,或需要帮助编写 AI 绘画提示词时使用此技能。支持直接生成图像和仅优化提示词两种模式。
xhs-images
Xiaohongshu (Little Red Book) infographic series generator with multiple style options. Breaks down content into 1-10 cartoon-style infographics. Use when user asks to create "小红书图片", "XHS images", or "RedNote infographics".
x-image-cards
Create X/Twitter cards that look like images, not marketing banners. Use when asked to "create OG images", "set up X cards", "make social cards", or "twitter card without text".
wiro-image-fill
Generate missing or placeholder images in a project by calling the Wiro image generation API, saving assets under public/assets generated folders, and producing a JSON mapping. Use when you see empty img src, placeholder.png, or other image gaps that need real assets.
seedream-image-generator
Generate images using the Doubao SeeDream API based on text prompts. Use this skill when users request AI-generated images, artwork, illustrations, or visual content creation. The skill handles API calls, downloads generated images to the project's /pic folder, and supports batch generation of up to 4 sequential images.
placeholder-images
Rule to use placekitten.com for placeholder images in seed data.
og-image-generator
Generate and optimize Open Graph meta images for social media sharing. Use this skill when building web applications that need dynamic OG image generation with support for Vercel's @vercel/og library, pre-generated image storage, and social media optimization (Twitter Cards, Facebook, LinkedIn). Handles dynamic routes, performance optimization, and includes best practices for crawler compatibility and testing.
nanobanana-ppt-skills
AI-powered PPT generation with document analysis and styled images
nanobanana-image
Nano Banana (Google Gemini API) を使って画像を生成・編集するスキル。「画像を生成して」「イラストを作って」「○○の絵を描いて」「画像を作成」「この画像を編集して」「この画像をもとに○○を作って」「generate an image」「create a picture」「edit this image」などの依頼があった場合に使用。テキストからの生成、参照画像からの生成、画像編集、Google検索グラウンディングによる最新情報を反映した画像生成に対応。「最新の○○」「トレンドを反映」「リアルタイム情報」といった依頼にも対応可能。
imagen
AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.