nano-banana
REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.
Best use case
nano-banana is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.
Teams using nano-banana should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/nano-banana/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How nano-banana Compares
| Feature / Agent | nano-banana | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
REQUIRED for all image generation requests. Generate and edit images using Nano Banana (Gemini CLI). Handles blog featured images, YouTube thumbnails, icons, diagrams, patterns, illustrations, photos, visual assets, graphics, artwork, pictures. Use this skill whenever the user asks to create, generate, make, draw, design, or edit any image or visual content.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Nano Banana Image Generation Generate professional images via the Gemini CLI's nanobanana extension. ## When to Use This Skill ALWAYS use this skill when the user: - Asks for any image, graphic, illustration, or visual - Wants a thumbnail, featured image, or banner - Requests icons, diagrams, or patterns - Asks to edit, modify, or restore a photo - Uses words like: generate, create, make, draw, design, visualize Do NOT attempt to generate images through any other method. ## Before First Use 1. Verify extension is installed: ```bash gemini extensions list | grep nanobanana ``` 2. If missing, install it: ```bash gemini extensions install https://github.com/gemini-cli-extensions/nanobanana ``` 3. Verify API key is set: ```bash [ -n "$GEMINI_API_KEY" ] && echo "API key configured" || echo "Missing GEMINI_API_KEY" ``` ## Command Selection | User Request | Command | |--------------|---------| | "make me a blog header" | `/generate` | | "create an app icon" | `/icon` | | "draw a flowchart of..." | `/diagram` | | "fix this old photo" | `/restore` | | "remove the background" | `/edit` | | "create a repeating texture" | `/pattern` | | "make a comic strip" | `/story` | ## Available Commands **Note:** Always use the `--yolo` flag to automatically approve all tool actions. | Command | Use Case | |---------|----------| | `gemini --yolo "/generate 'prompt'"` | Text-to-image generation | | `gemini --yolo "/edit file.png 'instruction'"` | Modify existing image | | `gemini --yolo "/restore old_photo.jpg 'fix scratches'"` | Repair damaged photos | | `gemini --yolo "/icon 'description'"` | App icons, favicons, UI elements | | `gemini --yolo "/diagram 'description'"` | Flowcharts, architecture diagrams | | `gemini --yolo "/pattern 'description'"` | Seamless textures and patterns | | `gemini --yolo "/story 'description'"` | Sequential/narrative images | | `gemini --yolo "/nanobanana prompt"` | Natural language interface | ## Common Options - `--yolo` - **Required.** Auto-approve all tool actions (no confirmation prompts) - `--count=N` - Generate N variations (1-8) - `--preview` - Auto-open generated images - `--styles="style1,style2"` - Apply artistic styles - `--format=grid|separate` - Output arrangement ## Common Sizes | Use Case | Dimensions | Notes | |----------|------------|-------| | YouTube thumbnail | 1280x720 | `--aspect=16:9` | | Blog featured image | 1200x630 | Social preview friendly | | Square social | 1080x1080 | Instagram, LinkedIn | | Twitter/X header | 1500x500 | Wide banner | | Vertical story | 1080x1920 | `--aspect=9:16` | ## Model Selection Default: `gemini-2.5-flash-image` (~$0.04/image) For higher quality (4K, better reasoning): ```bash export NANOBANANA_MODEL=gemini-3-pro-image-preview ``` ## Blog Featured Image Examples ```bash # Modern illustration style gemini --yolo "/generate 'modern flat illustration of developer coding at laptop, purple and blue gradient background, minimalist style, no text' --preview" # Professional photography style gemini --yolo "/generate 'professional editorial photo of coffee cup next to laptop on wooden desk, morning sunlight, shallow depth of field, no text' --count=3" # Tech/abstract gemini --yolo "/generate 'abstract visualization of neural network connections, dark background with glowing blue nodes, futuristic style' --preview" ``` ## Icon Generation ```bash gemini --yolo "/icon 'minimalist app logo for productivity tool' --sizes='64,128,256,512' --type='app-icon' --corners='rounded'" ``` ## Diagram Generation ```bash gemini --yolo "/diagram 'user authentication flow with OAuth' --type='flowchart' --style='modern'" ``` ## Output Location All generated images are saved to `./nanobanana-output/` in the current directory. ## Presenting Results After generation completes: 1. List contents of `./nanobanana-output/` to find generated files 2. Present the most recent image(s) to the user 3. Offer to regenerate with variations if needed ## Refinements and Iterations When the user asks for changes: - **"Try again" / "Give me options"**: Regenerate with `--count=3` - **"Make it more [adjective]"**: Adjust prompt and regenerate - **"Edit this one"**: Use `gemini --yolo "/edit nanobanana-output/filename.png 'adjustment'"` - **"Different style"**: Add `--styles="requested_style"` to the command ## Prompt Tips 1. **Be specific**: Include style, mood, colors, composition details 2. **Add "no text"**: If you don't want text rendered in the image 3. **Reference styles**: "editorial photography", "flat illustration", "3D render", "watercolor" 4. **Specify aspect ratio context**: "wide banner", "square thumbnail", "vertical story" ## Troubleshooting | Problem | Solution | |---------|----------| | `GEMINI_API_KEY` not set | `export GEMINI_API_KEY="your-key"` | | Extension not found | Run install command from setup section | | Quota exceeded | Wait for reset or switch to flash model | | Image generation failed | Check prompt for policy violations, simplify request | | Output directory missing | Will be created automatically on first run |
Related Skills
nano-banana-pro
Generate images using Google's Nano Banana Pro (gemini-3-pro-image-preview). Accepts text prompts and optionally images (for editing/transformation) as INPUT. Returns generated IMAGES as OUTPUT. Use when user asks to create, generate, edit, or draw images, infographics, visualizations, diagrams, charts, or illustrations. Excellent for data-accurate infographics and text rendering.
youtube-to-markdown
Use when user asks YouTube video extraction, get, fetch, transcripts, subtitles, or captions. Writes video details and transcription into structured markdown file.
youtube-seo-optimizer
Optimize YouTube videos for search and discovery. Generates SEO-optimized titles, descriptions, tags, hashtags, and chapters. Includes keyword research and competitor analysis. Use when publishing videos, improving discoverability, or optimizing existing content.
webfluence
Content web architecture framework. Use when diagnosing offer doc usage, content-to-conversion pathways, or why someone isn't getting sales despite traffic.
video-to-gif
Convert video clips to optimized GIFs with speed control, cropping, text overlays, and file size optimization. Create perfect GIFs for social media, documentation, and presentations.
video-title-optimizer
Optimize video titles for maximum click-through rate (CTR) and YouTube/TikTok SEO. Generates multiple title variations balancing curiosity, keywords, and platform best practices. Use when naming videos, improving CTR, or A/B testing titles.
video-script-writer
Write engaging video scripts for YouTube, TikTok, and other platforms. Creates complete scripts with hooks, main content, and CTAs. Supports various formats including tutorials, vlogs, reviews, explainers, and storytelling. Use when creating video scripts, writing YouTube content, or planning video structure.
video-script-collaborial
将视频脚本转换为更适合实际录制的口语化表达,去除书面化语言,增加自然感和亲和力。当用户提到"视频脚本"、"录制"、"口语化"、"自然一点"、"像说话一样"、"太书面了"时使用此技能。
video-hook-generator
Generate attention-grabbing hooks for the first 3 seconds of videos. The hook determines if viewers stay or scroll. Creates multiple hook variations for A/B testing. Use when crafting video openings, improving retention, or creating scroll-stopping content for YouTube, TikTok, or Reels.
youtube-downloader
Download YouTube videos with customizable quality and format options. Use this skill when the user asks to download, save, or grab YouTube videos. Supports various quality settings (best, 1080p, 720p, 480p, 360p), multiple formats (mp4, webm, mkv), and audio-only downloads as MP3.
video-comparer
This skill should be used when comparing two videos to analyze compression results or quality differences. Generates interactive HTML reports with quality metrics (PSNR, SSIM) and frame-by-frame visual comparisons. Triggers when users mention "compare videos", "video quality", "compression analysis", "before/after compression", or request quality assessment of compressed videos.
video-analytics-interpreter
Interpret YouTube Analytics, TikTok Analytics, and video performance data. Identifies trends, explains metrics, and provides actionable recommendations for growth. Use when analyzing video performance, understanding metrics, or optimizing channel strategy.