nano-image-generate

Generate images using Nano Banana (Flash) or Nano Banana Pro. Use 'flash' for speed/efficiency and 'pro' for high quality, text rendering, and complex prompt adherence. Triggers include 'generate image', 'create logo', 'fast image', 'high quality image'.

16 stars

bydiegosouzapw

View on GitHub Installation ↓

Best use case

nano-image-generate is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using nano-image-generate should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/nano-image-generate/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/content-media/nano-image-generate/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/nano-image-generate/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How nano-image-generate Compares

Feature / Agent	nano-image-generate	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Nano Image Generator

Generate images using Nano Banana Pro (Gemini 3 Pro Preview) for any visual asset needs. Supports **reference images** for style transfer and character consistency.

## Quick Start

> [!IMPORTANT]
> **Path Check**: Before running, localize the `scripts/generate_image.py` file. If running this skill from another project, use the absolute path to the script or copy it to your project.

```bash
# Basic generation (saves to ./generated/)
# Ensure you are running this from the directory containing 'scripts/' OR provide full path
python scripts/generate_image.py "A friendly robot mascot waving"

# With style reference (keep same visual style)
python scripts/generate_image.py "Same style, new content" --ref ./reference.jpg --output ./new.png
```

## Model Selection

Choose the model based on user intent:

| Model | ID | Use Case | Keywords |
|-------|----|----------|----------|
| **Pro** | `gemini-3-pro-image-preview` | **Quality & Detail**. Default choice. Best for text, complex prompts, logos, and high-fidelity assets. | "high quality", "logo", "text", "detail", "complex" |
| **Flash** | `gemini-2.5-flash-image` | **Speed & Efficiency**. Best for rapid prototyping, large batches, or when low latency is critical. | "fast", "quick", "draft", "rapid" |

## Script Usage

```bash
python scripts/generate_image.py <prompt> --output <path> [options]
```

**Required:**
- `prompt` - Image description

**Options:**
- `--output, -o` - Output file path. Optional. Defaults to `./generated/<slug>.png`.
- `--model, -m` - Model selection: `pro` (default) or `flash`
- `--key, -k` - Gemini API Key (optional)
- `--aspect, -a` - Aspect ratio (default: `1:1`)
- `--size, -s` - Resolution: `1K`, `2K` (default), `4K`
- `--ref, -r` - Reference image (max 14)

## Examples

**High Quality Logo (Pro - Default):**
```bash
python scripts/generate_image.py "Futuristic company logo with text 'NANO'" --output ./logo.png
```

**Quick Prototype (Flash):**
```bash
python scripts/generate_image.py "Draft sketch of a cat" --model flash --output ./cat_draft.png
```

## Reference Images

Gemini supports up to **14 reference images** for:

### Style Transfer
Keep the visual style (colors, textures, mood) from a reference:
```bash
python scripts/generate_image.py "New scene with mountains, same visual style as reference" \
  --ref ./style-reference.jpg --output ./styled-mountains.png
```

### Character Consistency
Maintain character appearance across multiple images:
```bash
python scripts/generate_image.py "Same character now in a forest setting" \
  --ref ./character.png --output ./character-forest.png
```

### Multi-Image Fusion
Combine elements from multiple references:
```bash
python scripts/generate_image.py "Combine the style of first image with subject of second" \
  --ref ./style.png --ref ./subject.png --output ./combined.png
```

### Serial Image Generation (Batch Workflow)
For generating a series with consistent style:
1. Generate first image
2. Use first image as `--ref` for subsequent images
3. Each new image inherits the established style

```bash
# Generate cover
python scripts/generate_image.py "Tech knowledge card cover" -o ./01-cover.png

# Generate subsequent cards with style reference
python scripts/generate_image.py "Card 2 content, same style" --ref ./01-cover.png -o ./02-card.png
python scripts/generate_image.py "Card 3 content, same style" --ref ./01-cover.png -o ./03-card.png
```

## Directory Structure

To keep things organized, this skill uses a standard directory structure:

- `./generated/` - **Default output location**. Images are automatically saved here if no path is provided.
- `./assets/` - Recommended for storing reference images.

## Workflow

1. **Verify Output Directory & Script Path**:
   - Ensure the output directory exists (e.g., `mkdir -p generated`) or is writable.
   - Confirm the location of `generate_image.py`.

2. **Craft effective prompts** - Be specific and descriptive:
   - Include style: "flat design", "3D rendered", "watercolor", "minimalist"
   - Include context: "for a mobile app", "website hero image"
   - Include details: colors, mood, composition
   - For references: mention "same style as reference" or "keep the visual style"

3. **Choose appropriate settings:**
   - Icons/logos → `--aspect 1:1`
   - Banners/headers → `--aspect 16:9` or `21:9`
   - Mobile screens → `--aspect 9:16`
   - Xiaohongshu cards → `--aspect 3:4`
   - Photos → `--aspect 3:2` or `4:3`

## Examples

**App icon:**
```bash
python scripts/generate_image.py "Minimalist flat design app icon of a lightning bolt, purple gradient background, modern iOS style" \
  --output ./assets/app-icon.png --aspect 1:1
```

**Marketing banner:**
```bash
python scripts/generate_image.py "Professional website hero banner for a productivity app, abstract geometric shapes, blue and white color scheme" \
  --output ./public/images/hero-banner.png --aspect 16:9
```

**Xiaohongshu knowledge card:**
```bash
python scripts/generate_image.py "Tech knowledge card, dark blue purple gradient, neon cyan accents, code block style, Chinese text '标题'" \
  --output ./xiaohongshu/card.png --aspect 3:4
```

**Style transfer:**
```bash
python scripts/generate_image.py "Transform this photo into watercolor painting style" \
  --ref ./photo.jpg --output ./watercolor.png
```

**Character in new scene:**
```bash
python scripts/generate_image.py "Same character from reference, now sitting in a cafe, warm lighting" \
  --ref ./character.png --output ./character-cafe.png --aspect 3:2
```

## Prompt Tips

- **Be specific** - "A red apple on a wooden table" vs "an apple"
- **Include style** - "in the style of pixel art" or "photorealistic"
- **Mention purpose** - "for a children's book" affects the output style
- **Describe composition** - "centered", "rule of thirds", "close-up"
- **Specify colors** - Explicit color palettes yield better results
- **Reference prompts** - Use "same style as reference", "keep the visual aesthetic", "match the color palette"
- **Avoid** - Don't ask for complex text in images (use overlays instead)

## Limitations

- Maximum 14 reference images per request
- Text rendering may be imperfect (better to overlay text separately)
- Very specific brand logos may not reproduce exactly

Related Skills

generate-llms

from diegosouzapw/awesome-omni-skill

Generate llms.txt and llms-full.txt files for AI agent consumption following the llmstxt.org standard. Use when updating site content that should be reflected in the llms files, or when building/deploying the site.

all-images-ai-automation

from diegosouzapw/awesome-omni-skill

Automate All Images AI tasks via Rube MCP (Composio). Always search tools first for current schemas.

ai-image-generator

from diegosouzapw/awesome-omni-skill

使用 ModelScope 等平台生成 AI 图像。当用户需要生成图像、设计图标、创建角色立绘，或需要帮助编写 AI 绘画提示词时使用此技能。支持直接生成图像和仅优化提示词两种模式。

xhs-images

from diegosouzapw/awesome-omni-skill

Xiaohongshu (Little Red Book) infographic series generator with multiple style options. Breaks down content into 1-10 cartoon-style infographics. Use when user asks to create "小红书图片", "XHS images", or "RedNote infographics".

x-image-cards

from diegosouzapw/awesome-omni-skill

Create X/Twitter cards that look like images, not marketing banners. Use when asked to "create OG images", "set up X cards", "make social cards", or "twitter card without text".

wiro-image-fill

from diegosouzapw/awesome-omni-skill

Generate missing or placeholder images in a project by calling the Wiro image generation API, saving assets under public/assets generated folders, and producing a JSON mapping. Use when you see empty img src, placeholder.png, or other image gaps that need real assets.

seedream-image-generator

from diegosouzapw/awesome-omni-skill

Generate images using the Doubao SeeDream API based on text prompts. Use this skill when users request AI-generated images, artwork, illustrations, or visual content creation. The skill handles API calls, downloads generated images to the project's /pic folder, and supports batch generation of up to 4 sequential images.

placeholder-images

from diegosouzapw/awesome-omni-skill

Rule to use placekitten.com for placeholder images in seed data.

og-image-generator

from diegosouzapw/awesome-omni-skill

Generate and optimize Open Graph meta images for social media sharing. Use this skill when building web applications that need dynamic OG image generation with support for Vercel's @vercel/og library, pre-generated image storage, and social media optimization (Twitter Cards, Facebook, LinkedIn). Handles dynamic routes, performance optimization, and includes best practices for crawler compatibility and testing.

nanobanana-ppt-skills

from diegosouzapw/awesome-omni-skill

AI-powered PPT generation with document analysis and styled images

nanobanana-image

from diegosouzapw/awesome-omni-skill

Nano Banana (Google Gemini API) を使って画像を生成・編集するスキル。「画像を生成して」「イラストを作って」「○○の絵を描いて」「画像を作成」「この画像を編集して」「この画像をもとに○○を作って」「generate an image」「create a picture」「edit this image」などの依頼があった場合に使用。テキストからの生成、参照画像からの生成、画像編集、Google検索グラウンディングによる最新情報を反映した画像生成に対応。「最新の○○」「トレンドを反映」「リアルタイム情報」といった依頼にも対応可能。

imagen

from diegosouzapw/awesome-omni-skill

AI image generation skill powered by Google Gemini, enabling seamless visual content creation for UI placeholders, documentation, and design assets.