gemini-api
Google Gemini 3 Pro Image API reference. Covers text-to-image, editing, reference images, aspect ratios, and error handling.
Best use case
gemini-api is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Google Gemini 3 Pro Image API reference. Covers text-to-image, editing, reference images, aspect ratios, and error handling.
Teams using gemini-api should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/gemini-api/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How gemini-api Compares
| Feature / Agent | gemini-api | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Google Gemini 3 Pro Image API reference. Covers text-to-image, editing, reference images, aspect ratios, and error handling.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
plugin: nanobanana updated: 2026-01-20 # Gemini Image API Reference ## Quick Start ```bash # Set API key export GEMINI_API_KEY="your-key" # Generate image uv run python main.py output.png "A minimal 3D cube" ``` ## API Key Setup 1. Visit: https://makersuite.google.com/app/apikey 2. Create new API key 3. Set environment variable: ```bash export GEMINI_API_KEY="your-api-key" ``` ## Supported Models | Model | Resolution | Best For | |-------|------------|----------| | gemini-3-pro-image-preview | Up to 4K | High quality | | gemini-2.5-flash-image | Up to 1K | Quick iterations | ## Aspect Ratios | Ratio | Use Case | |-------|----------| | 1:1 | Social media, icons | | 3:4 | Portrait photos | | 4:3 | Traditional photos | | 4:5 | Instagram portrait | | 5:4 | Landscape photos | | 9:16 | Mobile, stories | | 16:9 | YouTube, desktop | | 21:9 | Cinematic, ultrawide | ## CLI Flags | Flag | Description | Example | |------|-------------|---------| | `--style` | Apply style template | `--style styles/glass.md` | | `--edit` | Edit existing image | `--edit photo.jpg` | | `--ref` | Reference image | `--ref style.png` | | `--aspect` | Aspect ratio | `--aspect 16:9` | | `--model` | Model ID | `--model gemini-2.5-flash-image` | | `--max-retries` | Retry attempts | `--max-retries 5` | ## Error Codes | Code | Meaning | Recovery | |------|---------|----------| | `SUCCESS` | Operation completed | N/A | | `API_KEY_MISSING` | GEMINI_API_KEY not set | Export the variable | | `FILE_NOT_FOUND` | Referenced file missing | Check path | | `INVALID_INPUT` | Bad prompt or argument | Fix input | | `RATE_LIMITED` | Too many requests | Wait, uses auto-retry | | `NETWORK_ERROR` | Connection failed | Check network, auto-retry | | `API_ERROR` | Gemini API error | Check logs | | `CONTENT_POLICY` | Blocked prompt | Adjust content | | `TIMEOUT` | Request timed out | Retry | | `PARTIAL_FAILURE` | Some batch items failed | Check individual results | ## Retry Behavior The script automatically retries on transient errors: - Rate limits (429) - Server errors (502, 503) - Connection timeouts - Network errors Retry uses exponential backoff: 1s, 2s, 4s, 8s, etc. Maximum retries configurable with `--max-retries` (default: 3) ## Best Practices 1. **Prompts**: Be specific about style, lighting, composition 2. **Styles**: Use markdown templates for consistent results 3. **References**: Provide visual examples for style matching 4. **Batch**: Generate variations to pick the best 5. **Iteration**: Edit results to refine 6. **Retries**: Increase `--max-retries` for unreliable connections
Related Skills
test-skill
A test skill for validation testing. Use when testing skill parsing and validation logic.
bad-skill
This skill has invalid YAML in frontmatter
release
Plugin release process for MAG Claude Plugins marketplace. Covers version bumping, marketplace.json updates, git tagging, and common mistakes. Use when releasing new plugin versions or troubleshooting update issues.
openrouter-trending-models
Fetch trending programming models from OpenRouter rankings. Use when selecting models for multi-model review, updating model recommendations, or researching current AI coding trends. Provides model IDs, context windows, pricing, and usage statistics from the most recent week.
Claudish Integration Skill
**Version:** 1.0.0
transcription
Audio/video transcription using OpenAI Whisper. Covers installation, model selection, transcript formats (SRT, VTT, JSON), timing synchronization, and speaker diarization. Use when transcribing media or generating subtitles.
final-cut-pro
Apple Final Cut Pro FCPXML format reference. Covers project structure, timeline creation, clip references, effects, and transitions. Use when generating FCP projects or understanding FCPXML structure.
ffmpeg-core
FFmpeg fundamentals for video/audio manipulation. Covers common operations (trim, concat, convert, extract), codec selection, filter chains, and performance optimization. Use when planning or executing video processing tasks.
statusline-customization
Configuration reference and troubleshooting for the statusline plugin — sections, themes, bar widths, and script architecture
technical-audit
Technical SEO audit methodology including crawlability, indexability, and Core Web Vitals analysis. Use when auditing pages or sites for technical SEO issues.
serp-analysis
SERP analysis techniques for intent classification, feature identification, and competitive intelligence. Use when analyzing search results for content strategy.
schema-markup
Schema.org markup implementation patterns for rich results. Use when adding structured data to content for enhanced SERP appearances.