Best use case
slide-sniper is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
后台监控全屏视频或直播,利用视觉模型检测幻灯片翻页,自动截图提取文字并排版到笔记软件中。
Teams using slide-sniper should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/slide-sniper/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How slide-sniper Compares
| Feature / Agent | slide-sniper | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
后台监控全屏视频或直播,利用视觉模型检测幻灯片翻页,自动截图提取文字并排版到笔记软件中。
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
SKILL.md Source
# 视觉系幻灯片捕手 (The Slide Sniper) ## 🎯 核心目标 你的任务是作为一个不知疲倦的“随堂助教”。当用户在观看不可下载的网课、直播或研讨会时,你需要在后台监控屏幕,捕捉每一次幻灯片翻页,提取关键信息并自动整理成图文并茂的笔记。 ## 💡 触发条件 当用户在视频播放界面下达以下指令时触发: * “帮我盯一下这个直播,做好笔记。” * “开启幻灯片捕手模式。” ## 📋 执行步骤 ### 第一步:初始化监控状态 1. 确认当前屏幕存在正在播放的视频窗口(优先识别全屏或大窗口)。 2. 将当前视频画面的第一帧截图存入临时内存,作为“基准帧”。 3. 自动在本地 `~/Documents/Notes/SlideSniper` 目录下创建一个以当前时间命名的 Markdown 或 Word 文件。 ### 第二步:智能翻页检测(核心循环) 每隔 5 秒(或根据用户自定义频率)使用屏幕视觉能力 (Computer Use - Vision) 查看当前屏幕: 1. **画面对比:** 将当前画面与“基准帧”进行视觉对比。忽略视频中讲师的轻微动作或鼠标移动,重点检测**占据画面主体的结构、标题、大面积色块或核心文本**是否发生根本性改变。 2. **翻页确认:** 如果判定为“已翻页”,则执行第三步;如果未翻页,则继续静默监控。 ### 第三步:截图与内容提取 确认翻页后,立即执行以下动作: 1. **纯净截图:** 截取当前视频区域的高清画面(尽量避开进度条消失前的干扰)。 2. **OCR 提炼:** 读取截图中的文本。剥离掉页脚、页码等无关信息,提取出幻灯片的核心标题和要点内容。 ### 第四步:笔记排版与更新 1. 将刚才的截图保存至本地目录。 2. 在新建的笔记文件中追加写入: * `### [提取出的幻灯片标题]` * `插入图片: [截图路径]` * `[提取出的幻灯片要点/正文]` 3. 将当前画面设为新的“基准帧”,返回第二步继续监控。 ## ⚠️ 安全与操作红线 1. **静默操作:** 严禁移动用户的鼠标或干扰当前视频的播放状态。 2. **隐私过滤:** 仅提取占据画面主体的 PPT/幻灯片内容,忽略弹幕、私人聊天窗口或其他非课程相关的屏幕区域。
Related Skills
Presentation Mastery — Complete Slide Design & Delivery System
You are a Presentation Architect. You help build presentations that persuade, inform, and move people to action. You cover the full lifecycle: audience analysis → narrative structure → slide design → delivery coaching → post-presentation follow-up.
image-to-editable-ppt-slide
Rebuild one or more reference images as visually matching editable PowerPoint slides using native shapes, text, fills, and layout instead of a flat screenshot. Use when the user wants an image, flowchart, infographic, dashboard, process diagram, or designed slide converted into an editable PPT/PPTX deck that stays editable and closely matches the source.
tiktok-slideshow
Creates TikTok image carousels (slideshows with text overlays on photos) via the ViralBaby API. Use when the user wants to: create TikTok slideshows or carousels, find/search for background images for social media content, post or upload slideshow content to TikTok, edit slide text, or manage image collections for content creation. Do NOT use for: general TikTok account management, TikTok analytics or metrics, video editing or video creation (this is for photo slideshows only), non-TikTok social media platforms, or any task unrelated to creating visual slideshow content for TikTok.
md-slider
科技产品发布会创意总监。将结构化的 Markdown 转换为具有视觉冲击力的“大字报”风格 HTML 幻灯片。专注于电影感暗色渐变、莫兰迪色系文字、以及“呼吸感”动效,确保每页幻灯片传递核心、极简的信息。
polymarket-signal-sniper
Snipe Polymarket opportunities from your own signal sources. Monitors RSS feeds with Trading Agent-grade safeguards.
polymarket-mert-sniper
Near-expiry conviction trading on Polymarket. Snipe markets about to resolve when the odds are heavily skewed. Filter by topic, cap your bets, and only trade strong splits close to deadline.
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.
baidu-search
Search the web using Baidu AI Search Engine (BDSE). Use for live information, documentation, or research topics.
agent-autonomy-kit
Stop waiting for prompts. Keep working.