gemini-image-simple
Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.
Best use case
gemini-image-simple is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.
Teams using gemini-image-simple should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/gemini-image-simple/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How gemini-image-simple Compares
| Feature / Agent | gemini-image-simple | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Gemini Image Simple
Generate and edit images using Google's **Nano Banana Pro** (Gemini 3 Pro Image) - the highest quality image generation model.
## Why This Skill
| Feature | This Skill | Others (nano-banana-pro, etc.) |
|---------|------------|-------------------------------|
| **Dependencies** | None (stdlib only) | google-genai, pillow, etc. |
| **Requires pip/uv** | ❌ No | ✅ Yes |
| **Works on Fly.io free** | ✅ Yes | ❌ Fails |
| **Works in containers** | ✅ Yes | ❌ Often fails |
| **Image generation** | ✅ Full | ✅ Full |
| **Image editing** | ✅ Yes | ✅ Yes |
| **Setup complexity** | Just set API key | Install packages first |
**Bottom line:** This skill works anywhere Python 3 exists. No package managers, no virtual environments, no permission issues.
## Quick Start
```bash
# Generate
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "A cat wearing a tiny hat" cat.png
# Edit existing image
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "Make it sunset lighting" edited.png --input original.png
```
## Usage
### Generate new image
```bash
python3 {baseDir}/scripts/generate.py "your prompt" output.png
```
### Edit existing image
```bash
python3 {baseDir}/scripts/generate.py "edit instructions" output.png --input source.png
```
Supported input formats: PNG, JPG, JPEG, GIF, WEBP
## Environment
Set `GEMINI_API_KEY` environment variable. Get one at https://aistudio.google.com/apikey
## How It Works
Uses **Nano Banana Pro** (`nano-banana-pro-preview`) - Google's highest quality image generation model:
- Pure `urllib.request` for HTTP (no requests library)
- Pure `json` for parsing (stdlib)
- Pure `base64` for encoding (stdlib)
That's it. No external packages. Works on any Python 3.10+ installation.
## Model
Currently using: `nano-banana-pro-preview` (also known as Gemini 3 Pro Image)
Other available models (can be changed in generate.py if needed):
- `gemini-3-pro-image-preview` - Same as Nano Banana Pro
- `imagen-4.0-ultra-generate-001` - Imagen 4.0 Ultra
- `imagen-4.0-generate-001` - Imagen 4.0
- `gemini-2.5-flash-image` - Gemini 2.5 Flash with image gen
## Examples
```bash
# Landscape
python3 {baseDir}/scripts/generate.py "Misty mountains at sunrise, photorealistic" mountains.png
# Product shot
python3 {baseDir}/scripts/generate.py "Minimalist product photo of a coffee cup, white background" coffee.png
# Edit: change style
python3 {baseDir}/scripts/generate.py "Convert to watercolor painting style" watercolor.png --input photo.jpg
# Edit: add element
python3 {baseDir}/scripts/generate.py "Add a rainbow in the sky" rainbow.png --input landscape.png
```Related Skills
ImageMagick Moltbot Skill
Comprehensive ImageMagick operations for image manipulation in Moltbot.
simple-proposal-template-v2
Free basic proposal template for freelancers.
table-image
Generate images from tables for better readability in messaging apps like Telegram. Use when displaying tabular data.
ms-foundry-image-gen
Azure Foundry image generation skill for OpenClaw; generates images via a Foundry deployment and returns image.
antigravity-image-gen
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
gemini-image-gen
Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies — pure Python stdlib.
antigravity-image
Generate images using the internal Antigravity Sandbox API (Gemini 3 Pro Image). Supports text-to-image generation via internal Google endpoints.
zhipu-cogview-image
Generate images using Zhipu AI's CogView model.
doubao-image-gen
Use Zhipu (智谱) web search API for searching the internet.
10-of-my-most-popular-text-to-image-series-prompts-78b0897e
generate a bunch of images, then you curate the results to handpick the best ones
Seasonal Product Image
**Version**: 1.0.0
google-imagen-3-portrait-photography
Generate professional portrait photography using Google Imagen 3. Use when creating realistic portraits, headshots, or artistic character photography with professional lighting and composition.