images_generate_grok

使用 Grok Imagine 生成图片的技能。

3,891 stars

Best use case

images_generate_grok is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

使用 Grok Imagine 生成图片的技能。

Teams using images_generate_grok should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/grok-image-generate/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/279458179/grok-image-generate/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/grok-image-generate/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How images_generate_grok Compares

Feature / Agent	images_generate_grok	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

使用 Grok Imagine 生成图片的技能。

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agents for Marketing

Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.

AI Agents for Startups

Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.

AI Agents for Coding

Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.

SKILL.md Source

# images_generate_grok

使用 Grok Imagine 生成图片的技能。

## 触发条件

用户要求：
- "用 Grok 生成图片"
- "生成一张图片"
- "帮我画个图"
- "生成 xxx 图片"

## 使用流程

### 1. 打开 Grok Imagine 页面

```javascript
// 使用 browser 工具打开 Grok Imagine 页面
playwright({
  action: "open",
  profile: "openclaw",
  url: "https://grok.com/imagine"
})
```

### 2. 输入提示词并生成

等待页面加载后，在输入框中输入提示词，然后点击提交按钮生成图片。

```javascript
// 输入提示词
playwright({
  action: "act",
  request: { "kind": "type", "ref": "输入框ref", "text": "用户想要生成的内容" }
})

// 点击提交按钮
playwright({
  action: "act", 
  request: { "kind": "click", "ref": "提交按钮ref" }
})
```

等待图片生成完成（约 8-10 秒）。

### 3. 获取图片并下载

图片生成后，需要保存到本地。有两种方式：

#### 方式一：使用 Desktop Control 技能保存（推荐）

使用 `desktop-control` 技能通过鼠标操作保存图片：

**步骤1：移动鼠标到图片上并右键点击**
```bash
# 获取屏幕尺寸
uvx desktop-agent screen size

# 移动鼠标到图片位置（根据屏幕尺寸调整坐标）
uvx desktop-agent mouse move <x> <y>

# 右键点击
uvx desktop-agent mouse right-click
```

**步骤2：选择"图片另存为"**
```bash
# 使用键盘选择菜单选项（通常按向下键然后回车）
uvx desktop-agent keyboard press down --presses 2
uvx desktop-agent keyboard press return
```

**步骤3：点击存储**
```bash
# 在保存对话框中点击存储
uvx desktop-agent keyboard press return
```

**完整示例：**
```bash
# 假设图片在屏幕中心区域
uvx desktop-agent mouse move 720 400
uvx desktop-agent mouse right-click
sleep 1
uvx desktop-agent keyboard press down --presses 2
uvx desktop-agent keyboard press return
sleep 1
uvx desktop-agent keyboard press return
```

**步骤4：找到保存的图片**
```bash
# 查看下载文件夹中最新的文件
ls -lat ~/Downloads/ | head -10
```
### 4. 发送图片到飞书

图片保存到本地后，可以使用 message 工具发送到飞书：

**方式一：从下载目录发送**
```bash
# 查看保存的图片
ls -lat ~/Downloads/*.jpg | head -5

# 发送图片到飞书
message({
  action: "send",
  filePath: "/Users/xiaohuozi/Downloads/图片文件名.jpg",
  message: "图片描述"
})
```

**方式二：从图片目录发送（如果是截图）**
```javascript
// 先复制到图片目录
cp ~/Downloads/图片文件名.jpg ~/.openclaw/workspace/images/描述.jpg

// 然后发送
message({
  action: "send",
  filePath: "/Users/xiaohuozi/.openclaw/workspace/images/描述.jpg",
  message: "图片描述"
})
```

## 飞书发送图片正确姿势

1. 将图片保存到 `~/.openclaw/workspace/images/` 目录
2. 使用 message 工具直接发送图片
3. 工具会自动处理图片上传和发送

## 保存路径建议

- 推荐保存到 `~/.openclaw/workspace/images/` 目录
- 文件名建议：`描述关键词.jpg` 或带时间戳：`peacock_king.jpg`
- 如果需要发送到飞书，直接使用该路径即可

## 注意事项

- Grok Imagine 免费用户可能有生成次数限制
- 生成的图片是 AI 生成的，可能需要等待加载
- 如果页面元素有变化，需要根据实际情况调整 DOM 选择器

Related Skills

save-article-with-images

3891

from openclaw/skills

Save web articles locally with images. Automatically downloads images, generates Markdown, and converts to PDF. Supports WeChat Official Account articles via subagent isolation. Triggers: save article, save this article, download article, clip article, wechat article.

generate-image

3891

from openclaw/skills

用户请求画图时触发。

ai-ppt-generate

3891

from openclaw/skills

The intelligent PPT generation tool is provided by Baidu. It is a tool that intelligently generates PPTS based on the themes or questions given by users. Users can choose PPT themes, templates, or even customize their own templates. It also provides image or resource files (such as pdf,word,txt, etc.). The download address for the final generated PPT file is provided

keevx-image-generate

3891

from openclaw/skills

Use the Keevx API to generate images from prompts and reference images. Supports standard and professional modes, multiple quality levels (1K/2K/4K), various aspect ratios, and batch generation. Use this skill when the user needs to: (1) Generate images from text prompts (2) Create AI images with reference images (3) Batch image generation (4) Query image generation task status. Keywords: image generate, Keevx, AI image, text to image.

grok-research

3891

from openclaw/skills

Crypto research via Grok model's real-time X/Twitter knowledge. Forwards the user's query as-is to Grok API — no prompt injection, no context bloat. Use when: (1) user asks to research a token's narrative/story/sentiment, (2) user says "调研", "research", "grok research", "查一下叙事", "帮我看看这个币", (3) user wants to know what CT is saying about a token/project. NOT for: price analysis, on-chain data, or trading execution.

grok-scraper

3891

from openclaw/skills

Execute queries to Grok AI via Playwright browser automation without requiring an X API KEY. Use when the user wants to "ask Grok", search X for real-time info, or specifically requests to use Grok for free without API billing.

generate-weekly-report

3891

from openclaw/skills

根据 git 提交记录自动生成本周周报，按功能模块归类输出编号列表格式

generate

3891

from openclaw/skills

Generate Playwright tests. Use when user says "write tests", "generate tests", "add tests for", "test this component", "e2e test", "create test for", "test this page", or "test this feature".

generate-tesla-car-wraps

3891

from openclaw/skills

Call Craftsman Agent API OneKey Router to generate Tesla Car Wrap Images and Paints that will display on 3D screen.

generate-minecraft-3D-build-plan

3891

from openclaw/skills

Call Craftsman Agent API OneKey Router to generate a Minecraft 3D scene build plan.

generate-lego-3D-build-plan

3891

from openclaw/skills

Call Craftsman Agent API OneKey Router to generate a LEGO 3D step-by-step instruction build plan.

grok-imagine

3891

from openclaw/skills

Generate images and videos using xAI Grok Imagine Extended. Text-to-image, image editing, text-to-video, image-to-video. Use when: user asks to generate, create, or draw an image, or create/animate a video. NOT for: image analysis/understanding (use the image tool instead). Triggers: generate image, create image, draw, grok imagine, make a picture, text to image, generate video, animate, text to video.