gemini-image-simple

Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.

7 stars

Best use case

gemini-image-simple is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.

Teams using gemini-image-simple should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/gemini-image-simple/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/cluka-399/gemini-image-simple/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/gemini-image-simple/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How gemini-image-simple Compares

Feature / Agentgemini-image-simpleStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Gemini Image Simple

Generate and edit images using Google's **Nano Banana Pro** (Gemini 3 Pro Image) - the highest quality image generation model.

## Why This Skill

| Feature | This Skill | Others (nano-banana-pro, etc.) |
|---------|------------|-------------------------------|
| **Dependencies** | None (stdlib only) | google-genai, pillow, etc. |
| **Requires pip/uv** | ❌ No | ✅ Yes |
| **Works on Fly.io free** | ✅ Yes | ❌ Fails |
| **Works in containers** | ✅ Yes | ❌ Often fails |
| **Image generation** | ✅ Full | ✅ Full |
| **Image editing** | ✅ Yes | ✅ Yes |
| **Setup complexity** | Just set API key | Install packages first |

**Bottom line:** This skill works anywhere Python 3 exists. No package managers, no virtual environments, no permission issues.

## Quick Start

```bash
# Generate
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "A cat wearing a tiny hat" cat.png

# Edit existing image  
python3 /data/clawd/skills/gemini-image-simple/scripts/generate.py "Make it sunset lighting" edited.png --input original.png
```

## Usage

### Generate new image

```bash
python3 {baseDir}/scripts/generate.py "your prompt" output.png
```

### Edit existing image

```bash
python3 {baseDir}/scripts/generate.py "edit instructions" output.png --input source.png
```

Supported input formats: PNG, JPG, JPEG, GIF, WEBP

## Environment

Set `GEMINI_API_KEY` environment variable. Get one at https://aistudio.google.com/apikey

## How It Works

Uses **Nano Banana Pro** (`nano-banana-pro-preview`) - Google's highest quality image generation model:
- Pure `urllib.request` for HTTP (no requests library)
- Pure `json` for parsing (stdlib)
- Pure `base64` for encoding (stdlib)

That's it. No external packages. Works on any Python 3.10+ installation.

## Model

Currently using: `nano-banana-pro-preview` (also known as Gemini 3 Pro Image)

Other available models (can be changed in generate.py if needed):
- `gemini-3-pro-image-preview` - Same as Nano Banana Pro
- `imagen-4.0-ultra-generate-001` - Imagen 4.0 Ultra
- `imagen-4.0-generate-001` - Imagen 4.0
- `gemini-2.5-flash-image` - Gemini 2.5 Flash with image gen

## Examples

```bash
# Landscape
python3 {baseDir}/scripts/generate.py "Misty mountains at sunrise, photorealistic" mountains.png

# Product shot
python3 {baseDir}/scripts/generate.py "Minimalist product photo of a coffee cup, white background" coffee.png

# Edit: change style
python3 {baseDir}/scripts/generate.py "Convert to watercolor painting style" watercolor.png --input photo.jpg

# Edit: add element
python3 {baseDir}/scripts/generate.py "Add a rainbow in the sky" rainbow.png --input landscape.png
```

Related Skills

ImageMagick Moltbot Skill

7
from Demerzels-lab/elsamultiskillagent

Comprehensive ImageMagick operations for image manipulation in Moltbot.

simple-proposal-template-v2

7
from Demerzels-lab/elsamultiskillagent

Free basic proposal template for freelancers.

table-image

7
from Demerzels-lab/elsamultiskillagent

Generate images from tables for better readability in messaging apps like Telegram. Use when displaying tabular data.

ms-foundry-image-gen

7
from Demerzels-lab/elsamultiskillagent

Azure Foundry image generation skill for OpenClaw; generates images via a Foundry deployment and returns image.

antigravity-image-gen

7
from Demerzels-lab/elsamultiskillagent

Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.

gemini-image-gen

7
from Demerzels-lab/elsamultiskillagent

Generate and edit images via Google Gemini API. Supports Gemini native generation, Imagen 3, style presets, and batch generation with HTML gallery. Zero dependencies — pure Python stdlib.

antigravity-image

7
from Demerzels-lab/elsamultiskillagent

Generate images using the internal Antigravity Sandbox API (Gemini 3 Pro Image). Supports text-to-image generation via internal Google endpoints.

zhipu-cogview-image

7
from Demerzels-lab/elsamultiskillagent

Generate images using Zhipu AI's CogView model.

doubao-image-gen

7
from Demerzels-lab/elsamultiskillagent

Use Zhipu (智谱) web search API for searching the internet.

10-of-my-most-popular-text-to-image-series-prompts-78b0897e

7
from Demerzels-lab/elsamultiskillagent

generate a bunch of images, then you curate the results to handpick the best ones

Seasonal Product Image

7
from Demerzels-lab/elsamultiskillagent

**Version**: 1.0.0

google-imagen-3-portrait-photography

7
from Demerzels-lab/elsamultiskillagent

Generate professional portrait photography using Google Imagen 3. Use when creating realistic portraits, headshots, or artistic character photography with professional lighting and composition.