nano-banana-pro

Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.

151 stars

Best use case

nano-banana-pro is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.

Teams using nano-banana-pro should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/nano-banana-pro/SKILL.md --create-dirs "https://raw.githubusercontent.com/nicepkg/ai-workflow/main/workflows/talk-to-slidev-workflow/.claude/skills/nano-banana-pro/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/nano-banana-pro/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How nano-banana-pro Compares

Feature / Agentnano-banana-proStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Nano Banana Pro Image Generator

Generate images using Google's advanced Nano Banana Pro model (`gemini-3-pro-image-preview`).

## Prerequisites

The user must have `GEMINI_API_KEY` environment variable set with a valid Google AI API key.

## Usage

The script is located in the same directory as this SKILL.md file. Run it with `uv run`:

```bash
uv run /path/to/skills/nano-banana-pro/generate_image.py "your prompt" -o output.png
```

When this skill is invoked, locate `generate_image.py` in the skill directory and run it.

### Parameters

| Parameter | Required | Description |
|-----------|----------|-------------|
| `prompt` | Yes | Text description of the image to generate or transformation to apply |
| `-o`, `--output` | Yes | Output filename (you decide the path based on context) |
| `-i`, `--image` | No | Input image(s) for editing/transformation or as context/reference (can be used multiple times) |
| `--aspect-ratio` | No | One of: `1:1`, `2:3`, `3:2`, `3:4`, `4:3`, `4:5`, `5:4`, `9:16`, `16:9`, `21:9` (default: `1:1`) |
| `--size` | No | Image size: `1K`, `2K`, `4K` (default: `1K`) |

### Examples

Basic image generation:
```bash
uv run generate_image.py "A sunset over mountains" -o sunset.png
```

Infographic with specific aspect ratio:
```bash
uv run generate_image.py "Infographic showing the water cycle with labeled stages" -o water_cycle.png --aspect-ratio 9:16
```

High-resolution ultrawide:
```bash
uv run generate_image.py "Professional photo of a modern office space" -o office.png --aspect-ratio 21:9 --size 4K
```

Edit an existing image:
```bash
uv run generate_image.py "Add a sunset sky to this image" -i photo.png -o edited.png
```

Transform with style:
```bash
uv run generate_image.py "Make this look like a watercolor painting" -i input.jpg -o watercolor.png
```

Combine multiple images:
```bash
uv run generate_image.py "Create a collage blending these images together" -i img1.png -i img2.png -o collage.png
```

Use image as context/reference:
```bash
uv run generate_image.py "Generate a new landscape in the same style as this reference" -i reference.png -o new_landscape.png
```

## Model Capabilities

Nano Banana Pro excels at:
- **Accurate infographics** with real data (uses Google Search grounding)
- **Text rendering** in images
- **Image editing and transformation** from input images
- **Context-aware generation** using reference images for style, composition, or subject
- **Cartographic visualizations** and maps
- **Detailed instruction following**
- **Chain-of-thought reasoning** for complex visual tasks

## Output

The script prints:
- Progress message while generating
- Path to saved image on success
- Any text response from the model
- Error message if no image was generated

Related Skills

nano-banana

151
from nicepkg/ai-workflow

REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.

youtube-to-markdown

151
from nicepkg/ai-workflow

Use when user asks YouTube video extraction, get, fetch, transcripts, subtitles, or captions. Writes video details and transcription into structured markdown file.

youtube-seo-optimizer

151
from nicepkg/ai-workflow

Optimize YouTube videos for search and discovery. Generates SEO-optimized titles, descriptions, tags, hashtags, and chapters. Includes keyword research and competitor analysis. Use when publishing videos, improving discoverability, or optimizing existing content.

webfluence

151
from nicepkg/ai-workflow

Content web architecture framework. Use when diagnosing offer doc usage, content-to-conversion pathways, or why someone isn't getting sales despite traffic.

video-to-gif

151
from nicepkg/ai-workflow

Convert video clips to optimized GIFs with speed control, cropping, text overlays, and file size optimization. Create perfect GIFs for social media, documentation, and presentations.

video-title-optimizer

151
from nicepkg/ai-workflow

Optimize video titles for maximum click-through rate (CTR) and YouTube/TikTok SEO. Generates multiple title variations balancing curiosity, keywords, and platform best practices. Use when naming videos, improving CTR, or A/B testing titles.

video-script-writer

151
from nicepkg/ai-workflow

Write engaging video scripts for YouTube, TikTok, and other platforms. Creates complete scripts with hooks, main content, and CTAs. Supports various formats including tutorials, vlogs, reviews, explainers, and storytelling. Use when creating video scripts, writing YouTube content, or planning video structure.

video-script-collaborial

151
from nicepkg/ai-workflow

将视频脚本转换为更适合实际录制的口语化表达,去除书面化语言,增加自然感和亲和力。当用户提到"视频脚本"、"录制"、"口语化"、"自然一点"、"像说话一样"、"太书面了"时使用此技能。

video-hook-generator

151
from nicepkg/ai-workflow

Generate attention-grabbing hooks for the first 3 seconds of videos. The hook determines if viewers stay or scroll. Creates multiple hook variations for A/B testing. Use when crafting video openings, improving retention, or creating scroll-stopping content for YouTube, TikTok, or Reels.

youtube-downloader

151
from nicepkg/ai-workflow

Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, save, or grab YouTube videos. Supports various quality settings (best, 1080p, 720p, 480p, 360p), multiple formats (mp4, webm, mkv), and audio-only downloads as MP3.

video-comparer

151
from nicepkg/ai-workflow

This skill should be used when comparing two videos to analyze compression results or quality differences. Generates interactive HTML reports with quality metrics (PSNR, SSIM) and frame-by-frame visual comparisons. Triggers when users mention "compare videos", "video quality", "compression analysis", "before/after compression", or request quality assessment of compressed videos.

video-analytics-interpreter

151
from nicepkg/ai-workflow

Interpret YouTube Analytics, TikTok Analytics, and video performance data. Identifies trends, explains metrics, and provides actionable recommendations for growth. Use when analyzing video performance, understanding metrics, or optimizing channel strategy.