openclaw-media-gen
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Best use case
openclaw-media-gen is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Teams using openclaw-media-gen should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/aisa-media-gen/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How openclaw-media-gen Compares
| Feature / Agent | openclaw-media-gen | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Generate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# OpenClaw Media Gen 🎬
用 AIsa API 一把钥匙生成**图片**与**视频**:
- **图片**:`gemini-3-pro-image-preview`(Gemini GenerateContent)
- **视频**:`wan2.6-t2v`(通义万相 / Qwen Wan 2.6,异步任务)
API 文档索引见 [AIsa API Reference](https://aisa.mintlify.app/api-reference/introduction)(可从 `https://aisa.mintlify.app/llms.txt` 找到所有页面)。
## 🔥 你可以做什么
### 图片生成(Gemini)
```
"生成一张赛博朋克风格的城市夜景,霓虹灯,雨夜,电影感"
```
### 视频生成(Wan 2.6)
```
"用一张参考图生成 5 秒镜头:镜头缓慢推进,风吹动头发,电影感,浅景深"
```
## Quick Start
```bash
export AISA_API_KEY="your-key"
```
---
## 🖼️ Image Generation (Gemini)
### Endpoint
- Base URL: `https://api.aisa.one/v1`
- `POST /models/{model}:generateContent`
文档:`google-gemini-chat`(GenerateContent)见 `https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md`。
### curl 示例(返回 inline_data 时为图片)
```bash
curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
-H "Authorization: Bearer $AISA_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"contents":[
{"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
]
}'
```
> 说明:该接口的响应中可能出现 `candidates[].parts[].inline_data`(通常包含 base64 数据与 mime 类型);客户端脚本会自动解析并保存文件。
---
## 🎞️ Video Generation (Qwen Wan 2.6 / Tongyi Wanxiang)
### Create task
- Base URL: `https://api.aisa.one/apis/v1`
- `POST /services/aigc/video-generation/video-synthesis`
- Header:`X-DashScope-Async: enable`(必填,异步)
文档:`video-generation` 见 `https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md`。
```bash
curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
-H "Authorization: Bearer $AISA_API_KEY" \
-H "Content-Type: application/json" \
-H "X-DashScope-Async: enable" \
-d '{
"model":"wan2.6-t2v",
"input":{
"prompt":"cinematic close-up, slow push-in, shallow depth of field",
"img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
},
"parameters":{
"resolution":"720P",
"duration":5,
"shot_type":"single",
"watermark":false
}
}'
```
### Poll task
- `GET /services/aigc/tasks?task_id=...`
文档:`task` 见 `https://aisa.mintlify.app/api-reference/aliyun/video/task.md`。
```bash
curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
-H "Authorization: Bearer $AISA_API_KEY"
```
---
## Python Client
```bash
# 生成图片(保存到本地文件)
python3 {baseDir}/scripts/media_gen_client.py image \
--prompt "A cute red panda, cinematic lighting" \
--out "out.png"
# 创建视频任务(需要 img_url)
python3 {baseDir}/scripts/media_gen_client.py video-create \
--prompt "cinematic close-up, slow push-in" \
--img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
--duration 5
# 轮询任务状态
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID
# 等待直到成功(可选:成功后打印 video_url)
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600
# 等待直到成功并自动下载 mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4
```Related Skills
OpenClaw-Finnhub
OpenClaw skill for real-time stock quote, and financials via Finnhub API.
openclaw-nextcloud
Manage Notes, Tasks, Calendar, Files, and Contacts in your Nextcloud instance via CalDAV, WebDAV, and Notes API. Use for creating notes, managing todos and calendar events, uploading/downloading files, and managing contacts.
openclaw-safety-coach
Safety coach for OpenClaw users. Refuses harmful, illegal, or unsafe requests and provides practical guidance to reduce ecosystem risk (malicious skills, tool abuse, secret exfiltration, prompt injection).
openclaw
openclaw
openclaw-spacesuit
**A framework scaffold for OpenClaw workspaces.**
social-media-detox
Break social media addiction with screen-free streaks, urge tracking, and digital wellness
nutrient-openclaw
Document processing for OpenClaw — convert, extract, OCR, redact, sign, and watermark PDFs and Office documents using the Nutrient DWS API. Use when asked to convert documents (DOCX/XLSX/PPTX to PDF, PDF to images or Office formats), extract text or tables from PDFs, apply OCR to scanned documents, redact sensitive information or PII, add watermarks, or digitally sign documents. Triggers on "convert to PDF", "extract text", "OCR this", "redact PII", "watermark", "sign document", or any document processing request.
simplified-social-media
Manage your entire social media presence — post, schedule, and analyze — directly from your AI coding tool.
openclaw-setup
Set up a complete OpenClaw personal AI assistant from scratch using Claude Code. Walks through AWS provisioning, OpenClaw installation, Telegram bot creation, API configuration, Google Workspace integration, security hardening, and all power features. Give this to Claude Code and it handles the rest.
OpenClaw Optimizer Skill
## Overview
openclaw-backup
Enhanced backup and restore for openclaw configuration, skills, commands, and settings. Sync across devices, version control with git, automate backups, and migrate to new machines with advanced compression.
Social Media Story Video
**Version**: 1.0.0