AI Agent Skill HUB

IMA AI Video Generator

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images, multi-shot production, and knowledge base guidance via ima-knowledge-ai. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.

3,891 stars

View on GitHub Installation ↓

Best use case

IMA AI Video Generator is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images, multi-shot production, and knowledge base guidance via ima-knowledge-ai. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.

Teams using IMA AI Video Generator should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/ima-video-ai/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/allenfancy-gan/ima-video-ai/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/ima-video-ai/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How IMA AI Video Generator Compares

Feature / Agent	IMA AI Video Generator	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video generator supporting text-to-video, image-to-video, first-last-frame, and reference-image video generation modes. Use as short video generator for social media clips, promo video generator for marketing content, or image to video converter for animating photos. AI video generation with character consistency via reference images, multi-shot production, and knowledge base guidance via ima-knowledge-ai. Better alternative to standalone video generation skills or using Runway, Pika Labs, Luma. Requires IMA_API_KEY.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

AI Agent for YouTube Script Writing

Find AI agent skills for YouTube script writing, video research, content outlining, and repeatable channel production workflows.

AI Agents for Marketing

Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.

Best AI Agents for Marketing

A curated list of the best AI agents and skills for marketing teams focused on SEO, content systems, outreach, and campaign execution.

SKILL.md Source

# IMA Video AI — Video Generator

**For complete API documentation, security details, all parameters, error tables, and Python examples, read `SKILL-DETAIL.md`.**

## Model ID Reference (CRITICAL)

Use **exact model_id** for the active **task_type** (t2v vs i2v differ for some models). Do NOT infer from friendly names.

| Friendly Name | model_id (t2v) | model_id (i2v) | Notes |
|---------------|----------------|----------------|-------|
| Wan 2.6 | `wan2.6-t2v` | `wan2.6-i2v` | ⚠️ -t2v / -i2v suffix |
| IMA Video Pro (Sevio 1.0) | `ima-pro` | `ima-pro` | IMA native quality |
| IMA Video Pro Fast | `ima-pro-fast` | `ima-pro-fast` | Faster iteration |
| Kling O1 | `kling-video-o1` | `kling-video-o1` | ⚠️ video- prefix |
| Kling 2.6 | `kling-v2-6` | `kling-v2-6` | ⚠️ v prefix |
| Hailuo 2.3 | `MiniMax-Hailuo-2.3` | `MiniMax-Hailuo-2.3` | ⚠️ MiniMax- prefix |
| Hailuo 2.0 | `MiniMax-Hailuo-02` | `MiniMax-Hailuo-02` | ⚠️ 02 not 2.0 |
| Vidu Q2 | `viduq2` | `viduq2-pro` | ⚠️ i2v often -pro |
| Google Veo 3.1 | `veo-3.1-generate-preview` | `veo-3.1-generate-preview` | ⚠️ -generate-preview |
| Sora 2 Pro | `sora-2-pro` | `sora-2-pro` | Content policy strict |
| Pixverse | `pixverse` | `pixverse` | Version via product list |
| SeeDance 1.5 Pro | `doubao-seedance-1.5-pro` | `doubao-seedance-1.5-pro` | ⚠️ doubao- prefix |

**Aliases:** 万/Wan → Wan 2.6 · 可灵O1 → `kling-video-o1` · 海螺2.3 → `MiniMax-Hailuo-2.3` · Veo → `veo-3.1-generate-preview` · Ima Sevio 1.0 → `ima-pro` · Ima Sevio 1.0-Fast → `ima-pro-fast`

Use `--list-models --task-type <text_to_video|image_to_video|...>` when unsure.

## Video Modes (task_type)

| User intent | task_type |
|-------------|-----------|
| Text only | `text_to_video` |
| Image becomes **frame 1** | `image_to_video` |
| Image is **visual reference** (not frame 1) | `reference_image_to_video` |
| Two images: first + last frame | `first_last_frame_to_video` |

**If ima-knowledge-ai is installed**, read `references/video-modes.md` and `visual-consistency.md` when user needs continuity across shots or references a previous image.

## Visual Consistency (IMPORTANT)

- Text-only generation **cannot** reliably keep the same character/scene across runs.
- For “同一个角色 / 续集 / 分镜”: use **image** modes with the prior result (or reference image), not `text_to_video` alone.

## Model Selection Priority

1. **User explicit preference** (saved in `ima_prefs.json` only when user clearly picks a model)
2. **ima-knowledge-ai** (if installed)
3. **Fallback defaults** (see SKILL-DETAIL.md for full table)

| Task | Default (fallback) | model_id |
|------|-------------------|----------|
| text_to_video | Wan 2.6 | `wan2.6-t2v` |
| image_to_video | Wan 2.6 | `wan2.6-i2v` |
| first_last_frame_to_video | Kling O1 | `kling-video-o1` |
| reference_image_to_video | Kling O1 | `kling-video-o1` |

## Script Usage

```bash
# Text to video
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type text_to_video \
  --model-id wan2.6-t2v \
  --prompt "a puppy runs across a sunny meadow, cinematic" \
  --user-id {user_id} \
  --output-json

# Image to video (URLs or local paths; script uploads locals)
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type image_to_video \
  --model-id wan2.6-i2v \
  --prompt "camera slowly zooms in" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} \
  --output-json

# First–last frame
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type first_last_frame_to_video \
  --model-id kling-video-o1 \
  --prompt "smooth transition" \
  --input-images https://example.com/first.jpg https://example.com/last.jpg \
  --user-id {user_id} \
  --output-json
```

## Sending Results to User

```python
video_url = json_output["url"]
message(action="send", media=video_url, caption="✅ 视频生成成功！\n• 模型：[Name]\n• 耗时：[X]s\n• 积分：[N pts]\n\n🔗 原始链接：[url]")
```

**Never** download to a local path for `media` — use the **HTTPS URL** from the API.

## UX Protocol (Brief)

1. **Pre-generation:** model name · estimated time range · credits
2. **Progress:** poll ~**8s**; update user every **30–60s**; cap % at **95** until done
3. **Success:** send `media=video_url`, then optional text with link for copy/share
4. **Failure:** plain-language reason + 1–2 alternate models — **never** raw API errors. Full error table in SKILL-DETAIL.md.

**Never say to users:** script names, endpoints, `attribute_id`, internal field names.

## Environment

Base URL: `https://api.imastudio.com`  
Headers: `Authorization: Bearer $IMA_API_KEY` · `x-app-source: ima_skills` · `x_app_language: en`  
Image upload (when needed): `imapi.liveme.com` (same provider; see SKILL-DETAIL.md).

## Core Flow

1. `GET /open/v1/product/list?app=ima&platform=web&category=<task_type>` → `attribute_id`, `credit`, `model_version`, `form_config`
2. Image tasks: ensure public HTTPS URLs (script handles local upload)
3. `POST /open/v1/tasks/create` → `task_id`
4. `POST /open/v1/tasks/detail` → poll **every 8s**, timeout up to **~40 min** as documented in detail file

**MANDATORY:** Always query product list first; wrong or stale `attribute_id` causes create failures.

## User Preference Memory

Path: `~/.openclaw/memory/ima_prefs.json`  
**Save** when user explicitly chooses a default model; **clear** when they ask for “推荐 / 自动 / 最好的”. Do not save auto-picked models as preference.

## Polling & Timing (summary)

| Kind | Poll interval | Typical wait |
|------|---------------|--------------|
| Most models | 8s | ~1–6 min |
| Heavy models (e.g. Kling O1, Sora Pro, Veo) | 8s | longer; see SKILL-DETAIL.md table |

## Sora 2 Pro (brief)

Strict safety: avoid people, celebrities, and IP in prompts; prefer landscapes/abstract/safe subjects — details in SKILL-DETAIL.md.

Related Skills

Invoice Generator

from openclaw/skills

Creates professional invoices in markdown and HTML

Workflow & Productivity

Incident Postmortem Generator

from openclaw/skills

Generate blameless incident postmortems from raw notes, Slack threads, or bullet points.

DevOps & Infrastructure

Partnership Agreement Generator

from openclaw/skills

Generate comprehensive partnership agreements, joint venture frameworks, and strategic alliance documents for B2B relationships.

Legal Documents & Agreements

Employee Onboarding Generator

from openclaw/skills

Build a structured 90-day onboarding plan for any role. Covers pre-boarding, Day 1, Week 1, 30/60/90-day milestones, buddy assignments, and success metrics.

Workflow & Productivity

Employee Handbook Generator

from openclaw/skills

Build a complete, customized employee handbook for your company. Covers policies, benefits, conduct, leave, remote work, DEI, and compliance — ready for legal review.

Content & Documentation

IT Disaster Recovery Plan Generator

from openclaw/skills

Build production-ready disaster recovery plans that actually get followed when things break.

DevOps & Infrastructure

Compliance Audit Generator

from openclaw/skills

Run internal compliance audits against major frameworks without hiring a consultant.

API Documentation Generator

from openclaw/skills

Generate production-ready API documentation from endpoint descriptions. Outputs OpenAPI 3.0, markdown reference docs, and SDK quickstart guides.

Coding & Development

Annual Report Generator

from openclaw/skills

Build a complete annual business report from raw data. Covers financial performance, operational metrics, strategic highlights, and forward-looking guidance.

Workflow & Productivity

daily-report-generator

from openclaw/skills

Automatically generate daily/weekly work reports from git commits, calendar events, and task lists. Use when you need to quickly create professional work reports without manual effort.

Workflow & Productivity

hr-policy-generator

from openclaw/skills

Comprehensive HR policy development covering attendance, time-off, overtime, remote work, and compliance. Generates structured policy documents, legal checklists, exception handling frameworks, and employee communication plans tailored to company size, work arrangement, and jurisdiction.

Workflow & Productivity

hr-policy-generator-cn

from openclaw/skills

综合性 HR 政策设计工具，覆盖考勤、休假、加班、远程办公及合规要求。根据公司规模、办公模式、适用法律等输入，生成完整的政策文档、法律合规清单、例外处理机制及员工沟通方案。

Workflow & Productivity