nano-banana-pro
Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.
Best use case
nano-banana-pro is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.
Teams using nano-banana-pro should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/nano-banana-pro/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How nano-banana-pro Compares
| Feature / Agent | nano-banana-pro | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
SKILL.md Source
# Nano Banana Pro Image Generator Generate images using Google's advanced Nano Banana Pro model (`gemini-3-pro-image-preview`). ## Prerequisites The user must have `GEMINI_API_KEY` environment variable set with a valid Google AI API key. ## Usage The script is located in the same directory as this SKILL.md file. Run it with `uv run`: ```bash uv run /path/to/skills/nano-banana-pro/generate_image.py "your prompt" -o output.png ``` When this skill is invoked, locate `generate_image.py` in the skill directory and run it. ### Parameters | Parameter | Required | Description | |-----------|----------|-------------| | `prompt` | Yes | Text description of the image to generate or transformation to apply | | `-o`, `--output` | Yes | Output filename (you decide the path based on context) | | `-i`, `--image` | No | Input image(s) for editing/transformation or as context/reference (can be used multiple times) | | `--aspect-ratio` | No | One of: `1:1`, `2:3`, `3:2`, `3:4`, `4:3`, `4:5`, `5:4`, `9:16`, `16:9`, `21:9` (default: `1:1`) | | `--size` | No | Image size: `1K`, `2K`, `4K` (default: `1K`) | ### Examples Basic image generation: ```bash uv run generate_image.py "A sunset over mountains" -o sunset.png ``` Infographic with specific aspect ratio: ```bash uv run generate_image.py "Infographic showing the water cycle with labeled stages" -o water_cycle.png --aspect-ratio 9:16 ``` High-resolution ultrawide: ```bash uv run generate_image.py "Professional photo of a modern office space" -o office.png --aspect-ratio 21:9 --size 4K ``` Edit an existing image: ```bash uv run generate_image.py "Add a sunset sky to this image" -i photo.png -o edited.png ``` Transform with style: ```bash uv run generate_image.py "Make this look like a watercolor painting" -i input.jpg -o watercolor.png ``` Combine multiple images: ```bash uv run generate_image.py "Create a collage blending these images together" -i img1.png -i img2.png -o collage.png ``` Use image as context/reference: ```bash uv run generate_image.py "Generate a new landscape in the same style as this reference" -i reference.png -o new_landscape.png ``` ## Model Capabilities Nano Banana Pro excels at: - **Accurate infographics** with real data (uses Google Search grounding) - **Text rendering** in images - **Image editing and transformation** from input images - **Context-aware generation** using reference images for style, composition, or subject - **Cartographic visualizations** and maps - **Detailed instruction following** - **Chain-of-thought reasoning** for complex visual tasks ## Output The script prints: - Progress message while generating - Path to saved image on success - Any text response from the model - Error message if no image was generated
Related Skills
nano-banana
REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.
youtube-to-markdown
Use when user asks YouTube video extraction, get, fetch, transcripts, subtitles, or captions. Writes video details and transcription into structured markdown file.
youtube-seo-optimizer
Optimize YouTube videos for search and discovery. Generates SEO-optimized titles, descriptions, tags, hashtags, and chapters. Includes keyword research and competitor analysis. Use when publishing videos, improving discoverability, or optimizing existing content.
webfluence
Content web architecture framework. Use when diagnosing offer doc usage, content-to-conversion pathways, or why someone isn't getting sales despite traffic.
video-to-gif
Convert video clips to optimized GIFs with speed control, cropping, text overlays, and file size optimization. Create perfect GIFs for social media, documentation, and presentations.
video-title-optimizer
Optimize video titles for maximum click-through rate (CTR) and YouTube/TikTok SEO. Generates multiple title variations balancing curiosity, keywords, and platform best practices. Use when naming videos, improving CTR, or A/B testing titles.
video-script-writer
Write engaging video scripts for YouTube, TikTok, and other platforms. Creates complete scripts with hooks, main content, and CTAs. Supports various formats including tutorials, vlogs, reviews, explainers, and storytelling. Use when creating video scripts, writing YouTube content, or planning video structure.
video-script-collaborial
将视频脚本转换为更适合实际录制的口语化表达,去除书面化语言,增加自然感和亲和力。当用户提到"视频脚本"、"录制"、"口语化"、"自然一点"、"像说话一样"、"太书面了"时使用此技能。
video-hook-generator
Generate attention-grabbing hooks for the first 3 seconds of videos. The hook determines if viewers stay or scroll. Creates multiple hook variations for A/B testing. Use when crafting video openings, improving retention, or creating scroll-stopping content for YouTube, TikTok, or Reels.
youtube-downloader
Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, save, or grab YouTube videos. Supports various quality settings (best, 1080p, 720p, 480p, 360p), multiple formats (mp4, webm, mkv), and audio-only downloads as MP3.
video-comparer
This skill should be used when comparing two videos to analyze compression results or quality differences. Generates interactive HTML reports with quality metrics (PSNR, SSIM) and frame-by-frame visual comparisons. Triggers when users mention "compare videos", "video quality", "compression analysis", "before/after compression", or request quality assessment of compressed videos.
video-analytics-interpreter
Interpret YouTube Analytics, TikTok Analytics, and video performance data. Identifies trends, explains metrics, and provides actionable recommendations for growth. Use when analyzing video performance, understanding metrics, or optimizing channel strategy.