hair-cam-anno
安防摄像头视频 VL 模型微调数据集标注工具。用于从安防摄像头视频中提取关键帧、分析视频内容、生成结构化标注(含环境/人物/行为/风险描述),并输出符合 dataset.jsonl 格式的微调训练数据。Use when 用户需要对安防摄像头视频进行数据标注、生成 VL 模型训练数据集、处理 /root/hair-cam 目录下的视频数据,或提及 "hair-cam"、"数据标注"、"视频标注"、"VL模型微调"。
Best use case
hair-cam-anno is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
安防摄像头视频 VL 模型微调数据集标注工具。用于从安防摄像头视频中提取关键帧、分析视频内容、生成结构化标注(含环境/人物/行为/风险描述),并输出符合 dataset.jsonl 格式的微调训练数据。Use when 用户需要对安防摄像头视频进行数据标注、生成 VL 模型训练数据集、处理 /root/hair-cam 目录下的视频数据,或提及 "hair-cam"、"数据标注"、"视频标注"、"VL模型微调"。
Teams using hair-cam-anno should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/sjht-cam-anno/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How hair-cam-anno Compares
| Feature / Agent | hair-cam-anno | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
安防摄像头视频 VL 模型微调数据集标注工具。用于从安防摄像头视频中提取关键帧、分析视频内容、生成结构化标注(含环境/人物/行为/风险描述),并输出符合 dataset.jsonl 格式的微调训练数据。Use when 用户需要对安防摄像头视频进行数据标注、生成 VL 模型训练数据集、处理 /root/hair-cam 目录下的视频数据,或提及 "hair-cam"、"数据标注"、"视频标注"、"VL模型微调"。
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
SKILL.md Source
# hair-cam-anno — 安防摄像头视频标注
对安防摄像头拍摄的视频进行帧提取、视觉分析、结构化标注,输出 `dataset.jsonl` 格式的 VL 模型微调数据集。
## 工作流程
### 第1步:提取视频帧
```bash
python3 <skill>/scripts/extract_frames.py \
--data-dir <视频目录> \
--output-dir <帧输出目录> \
--fps 0.5 \
--max-frames 4
```
- 从每个视频均匀提取 4 帧(每2秒一帧)
- 生成 `manifest.json` 记录每个视频的元信息和帧路径
### 第2步:逐视频分析标注
对每个视频:
1. **查看提取的帧**:用 `read` 工具读取帧图片(支持 jpg/png)
2. **从文件名推断信息**:文件名包含关键信息(如 `海尔摄像头-1男1女-坐-2` → 品牌=海尔摄像头, 1男1女, 行为=坐)
3. **生成标注 JSON**:根据帧画面内容 + 文件名信息,生成结构化标注
标注 JSON 结构:
```json
{
"title": "场景标题",
"subtitle": "场景副标题",
"description": "详细描述(≥50字,含环境、人物外貌、行为姿态)",
"labels": ["system_suggest_X", ...],
"risk": {
"level": "none|low|medium|high",
"description": "风险描述"
},
"simple_description": "简练描述(≤20汉字)"
}
```
### 第3步:汇总生成 dataset.jsonl
1. 将所有标注结果收集到 `annotations.json`,格式:
```json
[
{"video": "文件名.mp4", "annotation": { ...标注JSON... }},
...
]
```
2. 运行构建脚本:
```bash
python3 <skill>/scripts/build_jsonl.py \
--annotations annotations.json \
--video-dir <视频目录> \
--output dataset.jsonl
```
3. 脚本会自动验证标注数据并生成 `dataset.jsonl`
## 关键参考
- **System prompt 模板**: `references/system-prompt.md`
- **标签范围**: `references/labels-reference.md`
## 标签选择规则
- 根据视频实际内容选择匹配标签
- 可多选,但不要选不匹配的标签
- 如果视频中有危险行为(儿童攀爬窗户、摔倒等),risk.level 应为 medium 或 high
- 文件名中的信息(人数、行为)必须与标注一致Related Skills
data-annotation
通用数据标注处理工具。当用户提到需要数据标注、有标注任务、数据处理、数据集生成、 标注查看/编辑时使用此 skill。支持图像、视频、文本等多种数据类型,调用模型进行内容理解 和标注,生成结构化标注数据,提供 Web 查看编辑界面。 触发短语:「标注」「annotation」「数据集」「label」「tag data」「数据处理」。
variant-annotation
Query and annotate gene variants from ClinVar and dbSNP databases. Trigger when: - User provides a variant identifier (rsID, HGVS notation, genomic coordinates) and asks about clinical significance - User mentions "ClinVar", "dbSNP", "variant annotation", "pathogenicity", "clinical significance" - User wants to know if a mutation is pathogenic, benign, or of uncertain significance - User provides VCF content or variant data requiring interpretation - Input: variant ID (rs12345), HGVS notation (NM_007294.3:c.5096G>A), or genomic coordinates (chr17:43094692:G>A) - Output: clinical significance, ACMG classification, allele frequency, disease associations
shannon-ai-pentester
Autonomous white-box AI pentester for web applications and APIs using source code analysis and live exploit execution
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.
baidu-search
Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.
agent-autonomy-kit
Stop waiting for prompts. Keep working.
Meeting Prep
Never walk into a meeting unprepared again. Your agent researches all attendees before calendar events—pulling LinkedIn profiles, recent company news, mutual connections, and conversation starters. Generates a briefing doc with talking points, icebreakers, and context so you show up informed and confident. Triggered automatically before meetings or on-demand. Configure research depth, advance timing, and output format. Walking into meetings blind is amateur hour—missed connections, generic small talk, zero leverage. Use when setting up meeting intelligence, researching specific attendees, generating pre-meeting briefs, or automating your prep workflow.
self-improvement
Captures learnings, errors, and corrections to enable continuous improvement. Use when: (1) A command or operation fails unexpectedly, (2) User corrects Claude ('No, that's wrong...', 'Actually...'), (3) User requests a capability that doesn't exist, (4) An external API or tool fails, (5) Claude realizes its knowledge is outdated or incorrect, (6) A better approach is discovered for a recurring task. Also review learnings before major tasks.
botlearn-healthcheck
botlearn-healthcheck — BotLearn autonomous health inspector for OpenClaw instances across 5 domains (hardware, config, security, skills, autonomy); triggers on system check, health report, diagnostics, or scheduled heartbeat inspection.