gemini-image

当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

242 stars

Best use case

gemini-image is best used when you need a repeatable AI agent workflow instead of a one-off prompt. It is especially useful for teams working in multi. 当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

Users should expect a more consistent workflow output, faster repeated execution, and less time spent rewriting prompts from scratch.

Practical example

Example input

Use the "gemini-image" skill to help with this workflow task. Context: 当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

Example output

A structured workflow result with clearer steps, more consistent formatting, and an output that is easier to reuse in the next run.

When to use this skill

Use this skill when you want a reusable workflow rather than writing the same prompt again and again.

When not to use this skill

Do not use this when you only need a one-off answer and do not need a reusable workflow.
Do not use it if you cannot install or maintain the related files, repository context, or supporting tools.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/gemini-image/SKILL.md --create-dirs "https://raw.githubusercontent.com/aiskillstore/marketplace/main/skills/ceeon/gemini-image/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/gemini-image/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How gemini-image Compares

Feature / Agent	gemini-image	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

当用户想要生成图片、画图、绘画、创建图像、AI作画时使用此 Skill。支持文生图和图生图。

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Gemini 图像生成

当用户表达画图意图时（如"画一个..."、"生成图片..."、"帮我创作..."），使用此 Skill。

## 调用步骤

### 1. 读取配置
- 读取 `config/secrets.md` 获取 API Key

### 2. 构造 prompt

| 模式 | prompt 格式 | 示例 |
|-----|------------|------|
| 文生图 | `描述文字` | `一只可爱的橘猫` |
| 图生图 | `图片URL 描述文字` | `https://xxx.jpg 画类似风格` |
| 多图参考 | `URL1 URL2 描述文字` | `https://a.jpg https://b.jpg 融合两张图` |

图生图需先上传图片，参考 `tips/image-upload.md`。

### 3. 调用 API

```bash
curl -s -X POST "https://api.apicore.ai/v1/images/generations" \
  -H "Authorization: Bearer API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "模型名称",
    "prompt": "提示词",
    "size": "尺寸比例",
    "n": 1
  }'
```

### 4. 返回结果

从响应中提取 `data[0].url` 返回给用户。

## 参考文档

- `tips/image-upload.md` - 图片上传方法
- `tips/chinese-text.md` - 中文文字处理技巧

Related Skills

image-assistant

242

from aiskillstore/marketplace

配图助手 - 把文章/模块内容转成统一风格、少字高可读的 16:9 信息图提示词；先定“需要几张图+每张讲什么”，再压缩文案与隐喻，最后输出可直接复制的生图提示词并迭代。

zimage-skill

242

from aiskillstore/marketplace

Generate images using ModelScope Z-Image-Turbo API. Use when user asks to generate, create, or make images, pictures, or illustrations.

qwen-image

242

from aiskillstore/marketplace

Generate and edit images with Alibaba Qwen-Image-2.0 models via inference.sh CLI. Models: Qwen-Image-2.0 (fast), Qwen-Image-2.0-Pro (professional text rendering). Capabilities: text-to-image, multi-image editing, complex text rendering. Triggers: qwen image, qwen-image, alibaba image, dashscope image, qwen image 2, qwen image pro

qwen-image-pro

242

from aiskillstore/marketplace

Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI. Professional text rendering, fine-grained realism, enhanced semantic adherence. Ideal for posters, banners, and text-heavy designs. Triggers: qwen image pro, qwen-image-pro, qwen 2 pro, alibaba image pro, dashscope pro, professional text rendering

gemini

242

from aiskillstore/marketplace

Use when the user asks to run Gemini CLI for code review, plan review, or big context (>200k) processing. Ideal for comprehensive analysis requiring large context windows. Uses Gemini 3 Pro by default for state-of-the-art reasoning and coding.

nerdzao-elite-gemini-high

242

from aiskillstore/marketplace

Modo Elite Coder + UX Pixel-Perfect otimizado especificamente para Gemini 3.1 Pro High. Workflow completo com foco em qualidade máxima e eficiência de tokens.

imagen

242

from aiskillstore/marketplace

Generates images using Google Gemini's image generation model for frontend UIs, documentation, and visual representations.

gemini-api-dev

242

from aiskillstore/marketplace

Use this skill when building applications with Gemini models, Gemini API, working with multimodal content (text, images, audio, video), implementing function calling, using structured outputs, or needing current model specifications. Covers SDK usage (google-genai for Python, @google/genai for JavaScript/TypeScript), model selection, and API capabilities.

fal-image-edit

242

from aiskillstore/marketplace

AI-powered image editing with style transfer and object removal

azure-ai-vision-imageanalysis-py

242

from aiskillstore/marketplace

Azure AI Vision Image Analysis SDK for captions, tags, objects, OCR, people detection, and smart cropping. Use for computer vision and image understanding tasks. Triggers: "image analysis", "computer vision", "OCR", "object detection", "ImageAnalysisClient", "image caption".

azure-ai-vision-imageanalysis-java

242

from aiskillstore/marketplace

Build image analysis applications with Azure AI Vision SDK for Java. Use when implementing image captioning, OCR text extraction, object detection, tagging, or smart cropping.

nano-image-generator

242

from aiskillstore/marketplace

Generate images using Nano Banana Pro (Gemini 3 Pro Preview). Use when creating app icons, logos, UI graphics, marketing banners, social media images, illustrations, diagrams, or any visual assets. Supports reference images for style transfer and character consistency. Triggers include phrases like 'generate an image', 'create a graphic', 'make an icon', 'design a logo', 'create a banner', 'same style as', 'keep the style', or any request needing visual content.