zhipu-free-image-video

智谱免费图片与视频生成技能。适用于用户想用智谱生成图片、批量出图、生成短视频、查询视频任务结果、等待视频完成，或优先使用免费/低成本模型快速产出创意内容时。

3,891 stars

Best use case

zhipu-free-image-video is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using zhipu-free-image-video should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/zhipu-free-image-video/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/156554395/zhipu-free-image-video/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/zhipu-free-image-video/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How zhipu-free-image-video Compares

Feature / Agent	zhipu-free-image-video	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agent for YouTube Script Writing

Find AI agent skills for YouTube script writing, video research, content outlining, and repeatable channel production workflows.

AI Agents for Marketing

Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.

AI Agents for Startups

Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.

SKILL.md Source

# 智谱免费图片与视频生成

把智谱的图片生成和视频生成能力整理成一套适合 OpenClaw 直接调用的创作工作流，重点突出免费或低成本模型的可用性。

适合这些任务：
- 根据一句话快速生成图片
- 一次性批量生成多张不同风格或不同主题的图片
- 根据提示词生成短视频
- 查询视频生成进度并等待任务完成
- 在创意验证阶段优先使用免费模型，降低试错成本

## 适用场景

当用户出现这些意图时启用：
- “帮我生成一张图”
- “批量出几版海报 / 封面 / 配图”
- “用智谱免费模型先做几张看看”
- “生成一个短视频”
- “查一下这个视频任务好了没”
- “等视频生成完成再告诉我”

## 核心定位

这个技能主打两件事：
- 智谱图片生成
- 智谱视频生成

默认优先强调免费或低成本模型：
- 图片优先：`cogview-3-flash`
- 视频优先：`cogvideox-flash`

如果用户明确追求更高质量，再切到更高规格模型。

## 脚本资源

优先使用 `scripts/` 里的可执行脚本来完成图片与视频任务。

可直接复用的脚本：
- `scripts/generate_image.js` - 生成单张图片
- `scripts/batch_generate_images.js` - 批量生成图片
- `scripts/generate_video.js` - 提交视频生成任务
- `scripts/query_video_result.js` - 查询视频任务结果
- `scripts/wait_for_video.js` - 等待视频生成完成
- `scripts/configure_models.js` - 校验任务希望使用的默认图片/视频模型

调用方式统一为：

```bash
node projects/skills/zhipu-free-image-video/scripts/<script>.js '<json>'
```

环境配置默认读取：
- `IMAGE_VIDEO_GENERATION_API_KEY`
- 或 `ZHIPU_API_KEY`

## 默认执行策略

### 1. 先确认目标产物

先判断用户到底要的是：
- 单张图片
- 批量图片
- 视频
- 已提交视频任务的进度或结果

### 2. 默认优先免费模型

如果用户没特别指定：
- 图片默认用免费或低成本模型快速出图
- 视频默认用免费模型快速出结果

这样更适合做灵感探索、风格试错和第一版草稿。

### 3. 批量任务优先控制节奏

批量出图时要注意：
- 分批处理
- 控制并发
- 汇总成功和失败结果
- 避免一次性堆太多请求

### 4. 视频任务默认按异步流程处理

视频生成通常不是即时返回最终结果，因此要按两段处理：
- 先提交任务，拿到任务 ID
- 再查询状态或等待完成

## 常用工作流

### 生成单张图片

适用：海报、封面、配图、头像、概念图。

默认做法：
- 先润色提示词，必要时补主体、风格、镜头、光线、背景
- 优先走免费图像模型
- 返回图片地址、提示词和模型信息

### 批量生成图片

适用：一次要多版候选图。

建议做法：
- 把多条提示词拆成批次
- 明确每一张图的主题差异
- 最终按“提示词 - 结果”方式汇总

适合场景：
- 多版封面
- 多个 IP 角色草图
- 多张文章配图
- 批量创意探索

### 生成视频

适用：短视频创意、动态概念演示、简单分镜验证。

建议做法：
- 提示词尽量包含主体、动作、场景、镜头感
- 默认优先快速模型
- 先返回任务提交结果，再继续查最终结果

### 查询视频结果

适用：用户给了任务 ID，让你看看视频好了没有。

返回时优先说明：
- 当前状态
- 是否完成
- 如果完成，给出视频地址
- 如果失败，给出失败原因或建议重试

### 等待视频完成

适用：用户希望“你等它出完再告诉我”。

处理方式：
- 设置合理的最大等待时间
- 定期查询状态
- 完成后返回最终结果
- 超时则明确告诉用户仍在处理中

## 质量与成本取舍

默认原则：
- 先免费，再高配
- 先快速验证，再精修
- 先出结果，再做多轮迭代

当用户说“先随便来几版看看”“先用免费的”“先低成本试试”时，优先免费模型。

当用户说“质量更高一点”“商业图”“正式发布素材”时，再考虑切换到更高质量模型。

## 风险与边界

- 不要承诺绝对免费永久可用，应该表述为优先使用免费或低成本模型
- 不泄露 API Key 或账户配置
- 批量生成时注意请求规模，避免过度并发
- 如果生成结果涉及敏感、违规或明显侵权内容，要及时收敛
- 视频生成耗时可能较长，要提前告知用户这是异步任务

## 故障排查

常见问题与处理：
- 生成失败：检查账号额度、模型可用性、提示词是否异常
- 视频一直未完成：延长等待时间，或改为稍后查询
- 批量任务部分失败：保留成功结果，单独重试失败项
- 结果不理想：优化提示词，增加风格、镜头、材质、动作细节
- 免费模型效果不够：明确告知用户可以切换更高质量模型

Related Skills

alphashop-image

3891

from openclaw/skills

AlphaShop（遨虾）图像处理 API 工具集。支持11个接口：图片翻译、图片翻译PRO、图片高清放大、图片主题抠图、图片元素识别、图片元素智能消除、图像裁剪、虚拟试衣（创建+查询）、模特换肤（创建+查询）。触发场景：图片翻译、翻译图片文字、放大图片、高清放大、抠图、去背景、检测水印/Logo/文字、消除水印、去牛皮癣、裁剪图片、虚拟试衣、AI试衣、模特换肤、换模特、AlphaShop图像、遨虾图片处理。

Image Processing & Analysis

exa-web-search-free

3891

from openclaw/skills

Free AI search via Exa MCP. Web search for news/info, code search for docs/examples from GitHub/StackOverflow, company research for business intel. No API key needed.

Data & Research

demo-video

3891

from openclaw/skills

Create product demo videos by automating browser interactions and capturing frames. Use when the user wants to record a demo, walkthrough, product showcase, or interactive video of a web application. Supports Playwright CDP screencast for high-quality capture and FFmpeg for video encoding.

Video Production

image-gen

3891

from openclaw/skills

Generate AI images from text prompts. Triggers on: "生成图片", "画一张", "AI图", "generate image", "配图", "create picture", "draw", "visualize", "generate an image".

Content & Documentation

bing-keyword-image-downloader

3891

from openclaw/skills

当用户需要按关键词从 Bing 公开图片搜索结果中批量下载图片时使用。遇到类似“帮我从 Bing 按关键词下载 10 张图片”“批量抓取 Bing 图片”“按关键词保存 Bing 图片到本地”这类请求时，应主动使用这个 skill。它专门处理基于关键词的 Bing 图片搜索、分页收集候选链接、跳过失败源站并保存到本地目录的工作流。

video-summarizer

3891

from openclaw/skills

将 B 站/YouTube/小红书/抖音视频转换为结构化 Notion 总结文档，自动上传截图，一键推送 Notion

video-script-creator

3891

from openclaw/skills

Short video script generator. 短视频脚本生成器、视频脚本、抖音文案、抖音脚本、快手脚本、口播稿、视频拍摄脚本、YouTube脚本、YouTube Shorts脚本、B站脚本、bilibili脚本、分镜脚本、视频大纲、视频文案、短视频创作、Reels脚本、TikTok脚本、vlog脚本、带货脚本、种草视频脚本、系列视频规划、视频数据复盘、完播率分析、前3秒钩子。Generate complete video scripts with hooks, outlines, titles, tags, CTA, storyboards, series planning, and data review. Use when: (1) creating short video scripts for any platform, (2) writing口播稿/talking-head scripts, (3) generating viral video titles, (4) planning video outlines and storyboards, (5) writing opening hooks (first 3 seconds), (6) generating CTA/ending prompts, (7) planning video series, (8) reviewing video performance data. 适用场景：写短视频脚本、拍摄脚本、口播文案、视频策划、爆款标题、开场钩子、结尾引导、完整分镜、系列规划、数据复盘。 Triggers on: video script creator.

doubao-image-video

3891

from openclaw/skills

豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时，直接调用火山引擎 Ark 接口，不依赖外部 MCP 服务。

IMA AI Video Generator

3891

from openclaw/skills

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images, multi-shot production, and knowledge base guidance via ima-knowledge-ai. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.

free-mission-control

3891

from openclaw/skills

JARVIS Mission Control v2 — free, self-hosted command center for OpenClaw AI agents. Kanban board, real-time chat, Claude Code session tracking, GitHub Issues sync, webhook delivery monitoring, CLI console, agent SOUL editor, and a full Matrix-themed dashboard.

Freelancer Business Autopilot Lite

3891

from openclaw/skills

Free version — generate invoices and weekly client updates from plain-language descriptions.

IMA Seedance 2.0 Video Generator

3891

from openclaw/skills

Seedance 2.0 AI video generator — two models in one skill: Seedance 2.0 (ima-pro) for cinema-grade quality with high frame-rate temporal consistency, precise camera language control, and 2K output; Seedance 2.0 Fast (ima-pro-fast) for faster iteration. Supports text-to-video, image-to-video, first-last-frame, and reference-media video generation with image, video, and audio references. Works for cinematic prompting, storyboard-driven clips, consistent-character workflows, product demos, and short-form content generation. Requires IMA_API_KEY.