windows-ui-controller
Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。
Best use case
windows-ui-controller is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。
Teams using windows-ui-controller should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/windows-ui-controller/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How windows-ui-controller Compares
| Feature / Agent | windows-ui-controller | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
SKILL.md Source
# Windows UI Controller
🖱️ **让 AI 控制任何 Windows 软件 - 微信/QQ/网易云/百度网盘...**
---
## 📦 这是什么?
**Windows UI Controller** 是一个技能包,让 AI 能够:
- ✅ 扫描 Windows 软件界面(按钮、输入框、菜单等)
- ✅ 点击任意按钮
- ✅ 输入文字
- ✅ 智能识别控件(搜索框 vs 聊天输入框)
- ✅ 每步操作后验证成功
---
## 🛠️ 安装依赖
### 离线安装(使用本技能包自带的依赖)
```bash
cd dependencies
pip install --no-index --find-links=. pywinauto
```
### 在线安装
```bash
pip install pywinauto
```
---
## 🚀 快速开始
### 1. 安装依赖
```bash
cd dependencies
pip install --no-index --find-links=. pywinauto
```
### 2. 学习使用
**详细教程请查看**: `README.md`
包含:
- pywinauto 是什么
- 完整使用示例
- ⚠️ 关键警告(每步验证)
- 💡 智能缓存建议
- 📸 视觉辅助纠错
- 编码设置
- 最佳实践
- API 参考
- 常见问题
---
## ⚠️ 关键警告
**每步操作必须扫描验证!不要一次执行多步!**
详见 `README.md` 中的详细说明。
---
## 📁 文件结构
```
windows-ui-controller/
├── SKILL.md # 本文件
├── README.md # 完整教程(20 KB)
└── dependencies/ # Python 依赖包
├── pywinauto-0.6.9-py2.py3-none-any.whl
├── six-1.17.0-py2.py3-none-any.whl
├── comtypes-1.4.16-py3-none-any.whl
└── pywin32-311-cp312-cp312-win_amd64.whl
```
---
## 💡 核心功能
### 1. 每步验证
操作后必须扫描确认成功
### 2. 智能缓存
记住按钮位置和功能,越用越熟练
### 3. 视觉辅助
失败时截图分析原因
---
## 📖 学习资源
- **完整教程**: `README.md`
- **pywinauto 官方文档**: https://pywinauto.readthedocs.io/
- **GitHub**: https://github.com/pywinauto/pywinauto
---
**版本**: 1.0.0
**创建时间**: 2026-03-30
**系统**: Windows 10/11
**Python**: 3.8+
**许可证**: MIT-0Related Skills
windows-screenshot-ocr
Windows全屏截图(自动标记鼠标位置)+ 原生OCR文字识别。完全本地运行,无需联网,无需API Key。适用于需要截图分析屏幕内容、自动化OCR识别的场景。
OpenClaw Install Guide (WSL2 Windows)
Complete step-by-step installation guide for OpenClaw on Windows 10/11 with WSL2, includes common pitfalls and solutions from real installation experience.
wsl-chrome-cdp - WSL2 访问 Windows Chrome 浏览器
**版本:** 1.0.0
windows-healing-gateway
OpenClaw Gateway Self-Healing System for Windows
slack-controller
Control Slack via Browser Automation to send messages, manage huddles, screen share, set status, and react as the logged-in user.
windows-tts
在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反馈"没声音/tts 生成的 mp3 是空的/播不出来",以及需要中文语音但 OpenClaw 内置 tts 不可用时。
evomap-node-controller
管理 EvoMap 节点的启动、配置和监控。用于在服务器上启动/停止 EvoMap evolver 循环、配置节点 ID,处理节点绑定等。使用场景:用户要求启动/停止 EvoMap 节点、查看节点状态、配置节点 ID、解决节点连接问题。⚠️ 使用前必须配置环境变量。
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.
baidu-search
Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.