arch-video-cut
Automatic Architecture Video Editing Workflow with Self-Learning Preferences
Best use case
arch-video-cut is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Automatic Architecture Video Editing Workflow with Self-Learning Preferences
Teams using arch-video-cut should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/arch-video-cut/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How arch-video-cut Compares
| Feature / Agent | arch-video-cut | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Automatic Architecture Video Editing Workflow with Self-Learning Preferences
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agent for YouTube Script Writing
Find AI agent skills for YouTube script writing, video research, content outlining, and repeatable channel production workflows.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
ChatGPT vs Claude for Agent Skills
Compare ChatGPT and Claude for AI agent skills across coding, writing, research, and reusable workflow execution.
SKILL.md Source
# arch-video-cut
Automatic Architecture Video Editing Workflow with Self-Learning Preferences
---
## Description
Automatically complete the full architecture video editing workflow: multi-video merging, speech-to-text subtitles, background music mixing, and dual output (landscape + portrait). Built-in self-learning system remembers your editing preferences.
**Core Features:**
- 🎬 Auto merge multiple videos + duration compression
- 🎙️ Speech transcription or custom subtitles
- 🎵 Smart background music generation + mixing
- 📱 Dual output: landscape (16:9) + portrait (3:4)
- 🧠 Self-evolving preference system
---
## Usage
### Quick Start
```bash
cd ~/.openclaw/workspace/skills/arch-video-cut
python3 scripts/full_workflow.py
```
### Prerequisites
1. **Install dependencies:**
```bash
brew install ffmpeg-full # Required for libass subtitle support
pip3 install faster-whisper # Optional: for speech transcription
```
2. **Prepare materials:**
- Audio file: `~/Desktop/新录音 XX.m4a` (narration voiceover)
- Video folder: `data/m1/` (architecture video clips to merge)
3. **Configure preferences (optional):**
```bash
python3 scripts/manage_preferences.py set
```
---
## Commands
| Command | Description |
|------|------|
| `python3 scripts/full_workflow.py` | Execute full editing workflow |
| `python3 scripts/manage_preferences.py show` | View current preferences |
| `python3 scripts/manage_preferences.py set` | Interactive preference editor |
| `python3 scripts/manage_preferences.py reset` | Reset to defaults |
---
## Configuration
### Preferences
Edit `config/user_preferences.json` or run `manage_preferences.py set`:
```json
{
"video": {
"target_duration": 20.0, // Target duration in seconds
"vertical_format": "3:4", // Portrait aspect ratio
"vertical_resolution": "1080x1440"
},
"subtitles": {
"horizontal_font_size": 14, // Landscape font size (px)
"vertical_font_size": 10, // Portrait font size (px)
"font_name": "STHeiti", // Font family
"auto_wrap": true, // Auto word wrap
"margin_v": 30 // Bottom margin (px)
},
"audio": {
"background_music_volume": 0.15, // BGM volume (0-1)
"fade_in_duration": 2, // Fade-in duration (sec)
"fade_out_duration": 2 // Fade-out duration (sec)
}
}
```
### Custom Subtitles
Edit the `subtitles_text` array in `transcribe_audio()` function:
```python
subtitles_text = [
"These six renovation projects were transformed from abandoned schools",
"Historic buildings, red brick houses, tile-roof homes, single-story factories, and rural self-built houses",
"Through minimalist design approaches and low-cost renovation strategies",
"Giving old buildings new life",
"While balancing contemporary aesthetics and market demands",
]
```
---
## Output
**Output location:** `data/` folder
| File | Description |
|------|------|
| `edited_video_final_with_subtitles.mp4` | Landscape version (16:9) |
| `edited_video_final_with_subtitles_3x4.mp4` | Portrait version (3:4) |
**Example output:**
```
✅ All done!
📁 Output: data/edited_video_final_with_subtitles.mp4
📊 Size: 16.0MB
🎬 Duration: 20.04 seconds
```
---
## Workflow
```
1. Merge videos → Compress to target duration
2. Generate subtitles → Allocate timeline based on audio duration
3. Generate BGM → Piano chords + fade in/out
4. Mix audio → Voiceover + background music
5. Burn subtitles → Landscape + Portrait versions
```
**Total processing time:** ~2-3 minutes (depends on video count and duration)
---
## Self-Learning
Built-in preference learning system automatically records your editing habits:
- 📝 Saves configuration after each edit
- 📊 Keeps last 20 adjustment records
- 🔄 Auto-applies preferences on next run
- 🎛️ Modify anytime via `manage_preferences.py`
**View learning history:**
```bash
python3 scripts/manage_preferences.py show
```
---
## Examples
### Example 1: Quick Edit
```bash
# Place 5 video clips in data/m1/
# Place voiceover audio at ~/Desktop/新录音 74.m4a
cd ~/.openclaw/workspace/skills/arch-video-cut
python3 scripts/full_workflow.py
```
### Example 2: Adjust Font Size
```bash
# Interactive modification
python3 scripts/manage_preferences.py set
# Input: horizontal font size 18px
# Re-edit with new font automatically applied
python3 scripts/full_workflow.py
```
### Example 3: Create 30-Second Version
```bash
# Modify preference
python3 scripts/manage_preferences.py set
# Input: target duration 30 seconds
# Edit
python3 scripts/full_workflow.py
```
---
## Troubleshooting
### ❌ ffmpeg-full not found
```bash
brew install ffmpeg-full # Required for libass subtitle support
```
### ❌ Subtitles not showing
Check if `ffmpeg-full` is installed (system ffmpeg doesn't support libass)
### ❌ Transcription failed
```bash
pip3 install faster-whisper
# Or skip transcription and edit subtitle text directly in script
```
### ❌ Wrong video aspect ratio
Modify `vertical_format` in `config/user_preferences.json`
---
## Files
```
arch-video-cut/
├── SKILL.md # This file
├── SELF_LEARNING_GUIDE.md # Self-learning detailed guide
├── README.md # Quick start guide
├── config/
│ └── user_preferences.json # User preferences
├── scripts/
│ ├── full_workflow.py # Main editing script
│ ├── preference_learner.py # Preference learner
│ └── manage_preferences.py # Preference manager
└── data/
├── m1/ # Input video folder
├── temp_edit/ # Temporary files
└── *.mp4 # Output videos
```
---
## Version
**v1.0.0** - 2026-03-18
- ✅ Multi-video merge + duration compression
- ✅ Custom subtitle text
- ✅ Background music generation + mixing
- ✅ Landscape + Portrait dual output
- ✅ Self-evolving preference system
---
## Author
**WildUrban Architect - Linwangming**
**Website:** http://www.ual-studio.com/
---
_Make tools adapt to you, not you to tools._ 🧠Related Skills
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.
baidu-search
Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.
Twitter Command Center (Search + Post)
Searches and reads X (Twitter): profiles, timelines, mentions, followers, tweet search, trends, lists, communities, and Spaces. Publishes posts after the user completes OAuth in the browser. Use when the user asks about Twitter/X data, social listening, or posting without sharing account passwords.
openclaw-search
Intelligent search for agents. Multi-source retrieval with confidence scoring - web, academic, and Tavily in one unified API.
search-for-service
Search and browse the x402 bazaar marketplace for paid API services. Use when you or the user want to find available services, see what's available, discover APIs, or need an external service to accomplish a task. Also use as a fallback when no other skill clearly matches — search the bazaar to see if a paid service exists. Covers "what can I do?", "find me an API for...", "what services are available?", "search for...", "browse the bazaar".
Interview Architect
Complete hiring interview system — from job scorecard design through structured question banks, live evaluation rubrics, panel coordination, and offer decisions. Eliminates gut-feel hiring with evidence-based frameworks that predict on-the-job performance.
afrexai-api-architect
Design, build, test, document, and secure production-grade APIs. Covers the full lifecycle from schema design through deployment, monitoring, and versioning. Use when designing new APIs, reviewing existing ones, generating OpenAPI specs, building test suites, or debugging production issues.
Agent Memory Architecture
Complete zero-dependency memory system for AI agents — file-based architecture, daily notes, long-term curation, context management, heartbeat integration, and memory hygiene. No APIs, no databases, no external tools. Works with any agent framework.
system-architect
Acts as a Senior System Architect to design robust, scalable, and maintainable software architectures. Enforces industry standards (PEP 8 for Python, ESLint for JS/TS), modular design, and security best practices. Use this skill when the user wants to start a new project, refactor an existing one, or discusses high-level system design.
search-cluster
Aggregated search aggregator using Google CSE, GNews RSS, Wikipedia, Reddit, and Scrapling.
mermaid-architect
Generate beautiful, hand-drawn Mermaid diagrams with robust syntax (quoted labels, ELK layout). Use this skill when the user asks for "diagram", "flowchart", "sequence diagram", or "visualize this process".
alphashop-sel-product-search
商品搜索API SKILL:通过关键词搜索发现Amazon/TikTok平台商品。 支持价格、销量、评分、上架时间等多维度筛选条件。 通过 AlphaShop REST API 调用遨虾AI选品系统的商品搜索服务。