videocut-install

环境准备。安装依赖、下载模型、验证环境。触发词：安装、环境准备、初始化

154 stars

byzrt-ai-lab

View on GitHub Installation ↓

Best use case

videocut-install is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

环境准备。安装依赖、下载模型、验证环境。触发词：安装、环境准备、初始化

Teams using videocut-install should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/videocut-install/SKILL.md --create-dirs "https://raw.githubusercontent.com/zrt-ai-lab/opencode-skills/main/videocut-install/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/videocut-install/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How videocut-install Compares

Feature / Agent	videocut-install	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

环境准备。安装依赖、下载模型、验证环境。触发词：安装、环境准备、初始化

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

<!--
input: 无
output: 环境就绪
pos: 前置 skill，首次使用前运行

架构守护者：一旦我被修改，请同步更新：
1. ../README.md 的 Skill 清单
2. /CLAUDE.md 路由表
-->

# 安装

> 首次使用前的环境准备

## 快速使用

```
用户: 安装环境
用户: 初始化
用户: 下载模型
```

## 依赖清单

| 依赖 | 用途 | 安装命令 |
|------|------|----------|
| funasr | 口误识别 | `pip install funasr` |
| modelscope | 模型下载 | `pip install modelscope` |
| openai-whisper | 字幕生成 | `pip install openai-whisper` |
| ffmpeg | 视频剪辑 | `brew install ffmpeg` |

## 模型清单

### FunASR 模型（口误识别用）

首次运行自动下载到 `~/.cache/modelscope/`：

| 模型 | 大小 | 用途 |
|------|------|------|
| paraformer-zh | 953MB | 语音识别（带时间戳） |
| punc_ct | 1.1GB | 标点预测 |
| fsmn-vad | 4MB | 语音活动检测 |
| **小计** | **~2GB** | |

### Whisper 模型（字幕生成用）

首次运行自动下载到 `~/.cache/whisper/`：

| 模型 | 大小 | 用途 |
|------|------|------|
| large-v3 | 2.9GB | 字幕转录（质量最好） |

### 总计

约 **5GB** 模型文件

## 安装流程

```
1. 安装 Python 依赖
       ↓
2. 安装 FFmpeg
       ↓
3. 下载 FunASR 模型（口误识别）
       ↓
4. 下载 Whisper 模型（字幕生成）
       ↓
5. 验证环境
```

## 执行步骤

### 1. 安装 Python 依赖

```bash
pip install funasr modelscope openai-whisper
```

### 2. 安装 FFmpeg

```bash
# macOS
brew install ffmpeg

# Ubuntu
sudo apt install ffmpeg

# 验证
ffmpeg -version
```

### 3. 下载 FunASR 模型（约2GB）

```python
from funasr import AutoModel

model = AutoModel(
    model="paraformer-zh",
    vad_model="fsmn-vad",
    punc_model="ct-punc",
)
print("FunASR 模型下载完成")
```

### 4. 下载 Whisper 模型（约3GB）

```python
import whisper

model = whisper.load_model("large-v3")
print("Whisper 模型下载完成")
```

### 5. 验证环境

```python
from funasr import AutoModel

model = AutoModel(
    model="paraformer-zh",
    vad_model="fsmn-vad",
    punc_model="ct-punc",
    disable_update=True
)

# 测试转录（用任意音频/视频）
result = model.generate(input="test.mp4")
print("文本:", result[0]['text'][:50])
print("时间戳数量:", len(result[0]['timestamp']))
print("✅ 环境就绪")
```

## 常见问题

### Q1: 模型下载慢

**解决**：使用国内镜像或手动下载

### Q2: ffmpeg 命令找不到

**解决**：确认已安装并添加到 PATH

```bash
which ffmpeg  # 应该输出路径
```

### Q3: funasr 导入报错

**解决**：检查 Python 版本（需要 3.8+）

```bash
python3 --version
```