vidu-video

使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频，或提到 vidu 时使用此 skill。

16 stars

bydiegosouzapw

View on GitHub Installation ↓

Best use case

vidu-video is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频，或提到 vidu 时使用此 skill。

Teams using vidu-video should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/vidu-video/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/content-media/vidu-video/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/vidu-video/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How vidu-video Compares

Feature / Agent	vidu-video	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

使用 Vidu Q3 Pro 模型生成视频。当用户想要文生视频、生成带音频的视频，或提到 vidu 时使用此 skill。

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Vidu Q3 视频生成

Vidu 最新 Q3 Pro 文生视频模型，支持多种分辨率（360p~1080p），可自定义时长和宽高比，支持直接生成带音频的视频。

## 可用模型

| 模型 ID | 功能 | 说明 |
|--------|------|------|
| `fal-ai/vidu/q3/text-to-video` | 文生视频 | 从文字描述生成视频，支持带音频输出 |

## 定价

按 **视频秒数 × 分辨率** 计费：

| 分辨率 | 价格（积分/秒） | 5秒视频 | 10秒视频 |
|--------|---------------|---------|---------|
| 360p / 540p | 28 积分/秒 | 140 积分 | 280 积分 |
| 720p / 1080p | 62 积分/秒 | 310 积分 | 620 积分 |

## 工作流

### 1. 调用 submit_task

使用 MCP 工具 `submit_task` 提交任务：

```json
{
  "model_id": "fal-ai/vidu/q3/text-to-video",
  "parameters": {
    "prompt": "In an ultra-realistic fashion photography style featuring light blue and pale amber tones, an astronaut in a spacesuit walks through the fog.",
    "duration": 5,
    "resolution": "720p",
    "aspect_ratio": "16:9",
    "audio": true
  }
}
```

### 参数说明

| 参数 | 类型 | 必填 | 默认值 | 说明 |
|-----|------|-----|-------|------|
| prompt | string | **是** | - | 视频生成提示词（最大 2000 字符） |
| duration | integer | 否 | 5 | 视频时长（秒），1-16 |
| resolution | string | 否 | "720p" | 分辨率：360p / 540p / 720p / 1080p |
| aspect_ratio | string | 否 | "16:9" | 宽高比：16:9 / 9:16 / 4:3 / 3:4 / 1:1 |
| audio | boolean | 否 | true | 是否生成带音频的视频 |
| seed | integer | 否 | - | 随机种子，用于复现结果 |

## 查询任务状态

提交任务后会返回 `task_id`，使用 `get_task` 查询结果：

```json
{
  "task_id": "返回的任务ID"
}
```

任务状态：
- `pending` - 排队中
- `processing` - 处理中
- `completed` - 完成，结果在 `result` 中
- `failed` - 失败，查看 `error` 字段

## 完整示例

**用户请求**：生成一段宇航员在雾中行走的视频

**执行步骤**：

1. 调用 `submit_task`：
```json
{
  "model_id": "fal-ai/vidu/q3/text-to-video",
  "parameters": {
    "prompt": "In an ultra-realistic fashion photography style featuring light blue and pale amber tones, an astronaut in a spacesuit walks through the fog.",
    "duration": 5,
    "resolution": "720p",
    "aspect_ratio": "16:9"
  }
}
```

2. 获取 `task_id` 后调用 `get_task` 查询结果

## 提示词技巧

1. **描述清晰**：详细描述场景、动作、光线和色调
2. **英文优先**：使用英文提示词通常效果更好
3. **风格关键词**：加入 "cinematic"、"ultra-realistic"、"photography style" 等风格词
4. **分辨率选择**：360p/540p 成本低适合测试，720p/1080p 适合正式输出
5. **音频功能**：默认开启 audio=true，生成的视频会自带声音

## 注意事项

- 720p 和 1080p 分辨率价格是 360p/540p 的 2.2 倍
- 视频时长支持 1-16 秒灵活选择
- 支持多种宽高比，适合不同平台的内容需求

Related Skills

videodb-skills

from diegosouzapw/awesome-omni-skill

Upload, stream, search, edit, transcribe, and generate AI video and audio using the VideoDB SDK.

videocut:安装

from diegosouzapw/awesome-omni-skill

环境准备。安装依赖、下载模型。触发词：安装、环境准备、初始化

video

from diegosouzapw/awesome-omni-skill

Generate videos using fal.ai (Wan, Kling) or Sora. Text-to-video and image-to-video.

video-toolkit

from diegosouzapw/awesome-omni-skill

Intelligent video processor for downloading media and extracting transcripts from YouTube and 1000+ supported sites. Automatically handles format selection, subtitle extraction, and post-processing.

video-processing-editing

from diegosouzapw/awesome-omni-skill

FFmpeg automation for cutting, trimming, concatenating videos. Audio mixing, timeline editing, transitions, effects. Export optimization for YouTube, social media. Subtitle handling, color grading, batch processing. Use for videogen projects, content creation, automated video production. Activate on "video editing", "FFmpeg", "trim video", "concatenate", "transitions", "export optimization". NOT for real-time video editing UI, 3D compositing, or motion graphics.

video-commercial

from diegosouzapw/awesome-omni-skill

Generate 30-second video commercials from a concept. Creates storyboard, generates scene images, adds narration via ElevenLabs, assembles final video. Use when asked to create commercials, promo videos, video ads, or short marketing videos.

video-analyzer

from diegosouzapw/awesome-omni-skill

鏅鸿兘鍒嗘瀽 Bilibili/YouTube/鏈湴瑙嗛锛岀敓鎴愯浆鍐欍€佽瘎浼板拰鎬荤粨銆傛敮鎸佸叧閿抚鎴浘鑷姩宓屽叆銆?

Media Uploader - R2/S3 with video download

from diegosouzapw/awesome-omni-skill

Upload files or download videos from popular platforms (YouTube, Vimeo, Bilibili, etc.) and upload to Cloudflare R2, AWS S3, or any S3-compatible storage with secure presigned download links.

ltxv2-video

from diegosouzapw/awesome-omni-skill

Build LTX-V2 19B video workflows — text-to-video, image-to-video, distilled model, camera control LoRAs, and two-stage upscaling

edu-video-analyzer

from diegosouzapw/awesome-omni-skill

Analyze educational YouTube channels for classroom adoption potential, curriculum alignment, and pedagogical effectiveness. Use when comparing educational video content (like MRU vs Crash Course), evaluating teaching methodologies, identifying content gaps for course design, or developing educational video strategy focused on student learning outcomes rather than monetization.

Automate YouTube Top-Ten Video Creation with OpenAI and Safe Image Search

from diegosouzapw/awesome-omni-skill

Integrates OpenAI API for content generation, Bing Image Search API for safe image retrieval, and Pexels API for video footage. Handles authentication via Bearer token, enforces safe search, formats ChatGPT responses into a top-ten list, and includes error handling for API failures.

apex-video-generator

from diegosouzapw/awesome-omni-skill

Generate real estate marketing videos from property data. Use when creating property showcases, social media content, market reports, or neighborhood tours. Integrates Firecrawl scraped data with Remotion rendering.