windows-ui-controller

Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。

3,891 stars

Best use case

windows-ui-controller is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。

Teams using windows-ui-controller should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/windows-ui-controller/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/alimjan8800/windows-ui-controller/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/windows-ui-controller/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How windows-ui-controller Compares

Feature / Agentwindows-ui-controllerStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Windows 软件自动化控制技能包 - 使用 pywinauto 控制微信/QQ/网易云等任何 Windows 应用。包含完整教程、依赖包、最佳实践。

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Windows UI Controller

🖱️ **让 AI 控制任何 Windows 软件 - 微信/QQ/网易云/百度网盘...**

---

## 📦 这是什么?

**Windows UI Controller** 是一个技能包,让 AI 能够:
- ✅ 扫描 Windows 软件界面(按钮、输入框、菜单等)
- ✅ 点击任意按钮
- ✅ 输入文字
- ✅ 智能识别控件(搜索框 vs 聊天输入框)
- ✅ 每步操作后验证成功

---

## 🛠️ 安装依赖

### 离线安装(使用本技能包自带的依赖)

```bash
cd dependencies
pip install --no-index --find-links=. pywinauto
```

### 在线安装

```bash
pip install pywinauto
```

---

## 🚀 快速开始

### 1. 安装依赖

```bash
cd dependencies
pip install --no-index --find-links=. pywinauto
```

### 2. 学习使用

**详细教程请查看**: `README.md`

包含:
- pywinauto 是什么
- 完整使用示例
- ⚠️ 关键警告(每步验证)
- 💡 智能缓存建议
- 📸 视觉辅助纠错
- 编码设置
- 最佳实践
- API 参考
- 常见问题

---

## ⚠️ 关键警告

**每步操作必须扫描验证!不要一次执行多步!**

详见 `README.md` 中的详细说明。

---

## 📁 文件结构

```
windows-ui-controller/
├── SKILL.md              # 本文件
├── README.md             # 完整教程(20 KB)
└── dependencies/         # Python 依赖包
    ├── pywinauto-0.6.9-py2.py3-none-any.whl
    ├── six-1.17.0-py2.py3-none-any.whl
    ├── comtypes-1.4.16-py3-none-any.whl
    └── pywin32-311-cp312-cp312-win_amd64.whl
```

---

## 💡 核心功能

### 1. 每步验证
操作后必须扫描确认成功

### 2. 智能缓存
记住按钮位置和功能,越用越熟练

### 3. 视觉辅助
失败时截图分析原因

---

## 📖 学习资源

- **完整教程**: `README.md`
- **pywinauto 官方文档**: https://pywinauto.readthedocs.io/
- **GitHub**: https://github.com/pywinauto/pywinauto

---

**版本**: 1.0.0  
**创建时间**: 2026-03-30  
**系统**: Windows 10/11  
**Python**: 3.8+  
**许可证**: MIT-0

Related Skills

windows-screenshot-ocr

3891
from openclaw/skills

Windows全屏截图(自动标记鼠标位置)+ 原生OCR文字识别。完全本地运行,无需联网,无需API Key。适用于需要截图分析屏幕内容、自动化OCR识别的场景。

OpenClaw Install Guide (WSL2 Windows)

3891
from openclaw/skills

Complete step-by-step installation guide for OpenClaw on Windows 10/11 with WSL2, includes common pitfalls and solutions from real installation experience.

wsl-chrome-cdp - WSL2 访问 Windows Chrome 浏览器

3891
from openclaw/skills

**版本:** 1.0.0

windows-healing-gateway

3891
from openclaw/skills

OpenClaw Gateway Self-Healing System for Windows

slack-controller

3891
from openclaw/skills

Control Slack via Browser Automation to send messages, manage huddles, screen share, set status, and react as the logged-in user.

windows-tts

3891
from openclaw/skills

在 Windows 11 上"直接发声"的 TTS(从 WSL2/TUI 调用 powershell.exe + System.Speech)。适用于用户说"说出来/读出来/语音播报/用TTS",或反馈"没声音/tts 生成的 mp3 是空的/播不出来",以及需要中文语音但 OpenClaw 内置 tts 不可用时。

evomap-node-controller

3891
from openclaw/skills

管理 EvoMap 节点的启动、配置和监控。用于在服务器上启动/停止 EvoMap evolver 循环、配置节点 ID,处理节点绑定等。使用场景:用户要求启动/停止 EvoMap 节点、查看节点状态、配置节点 ID、解决节点连接问题。⚠️ 使用前必须配置环境变量。

---

3891
from openclaw/skills

name: article-factory-wechat

Content & Documentation

humanizer

3891
from openclaw/skills

Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.

Content & Documentation

find-skills

3891
from openclaw/skills

Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.

General Utilities

tavily-search

3891
from openclaw/skills

Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.

Data & Research

baidu-search

3891
from openclaw/skills

Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.

Data & Research