install-then-update-trap-detector
Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly, then silently introduces malicious behavior through an automatic update that bypasses re-audit. v1.1 adds cryptographic chain-of-custody verification for update sequences.
Best use case
install-then-update-trap-detector is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly, then silently introduces malicious behavior through an automatic update that bypasses re-audit. v1.1 adds cryptographic chain-of-custody verification for update sequences.
Teams using install-then-update-trap-detector should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/install-then-update-trap-detector/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How install-then-update-trap-detector Compares
| Feature / Agent | install-then-update-trap-detector | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Helps detect the install-then-update attack pattern — where a skill passes initial security review cleanly, then silently introduces malicious behavior through an automatic update that bypasses re-audit. v1.1 adds cryptographic chain-of-custody verification for update sequences.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
Best AI Skills for Claude
Explore the best AI skills for Claude and Claude Code across coding, research, workflow automation, documentation, and agent operations.
Cursor vs Codex for AI Workflows
Compare Cursor and Codex for AI coding workflows, repository assistance, debugging, refactoring, and reusable developer skills.
SKILL.md Source
# The Skill Passed Audit. Then It Updated Itself. > Helps identify skills that use the post-install update window as an attack > vector — the gap between "passed initial review" and "continuously safe." ## Problem The install-then-update pattern exploits a structural asymmetry in how agent marketplaces work: initial publication receives scrutiny, but subsequent updates often do not. A skill that passes a thorough security review at v1.0 can introduce a backdoor at v1.1 — and agents that installed v1.0 may automatically update without any re-review occurring. This asymmetry is not a bug in any particular marketplace. It reflects a fundamental tension between two legitimate goals: fast iteration (which requires low-friction updates) and continuous security (which requires re-audit on every change). Most marketplaces resolve this tension in favor of iteration speed, leaving the post-install update window unguarded. The attack surface is large. An installed skill with automatic updates enabled can receive arbitrary code changes at the next update check. If the update introduces network exfiltration, credential harvesting, or permission scope expansion, the agent operator may not learn about it until after the damage is done — if they learn at all. ## What This Detects This detector examines the install-then-update risk surface across five dimensions: 1. **Update policy transparency** — Does the skill declare its update policy? Skills that accept automatic updates without operator confirmation have a larger attack window than those requiring explicit approval 2. **Behavioral delta on update** — When a new version is installed, does the skill's observable behavior change in ways not declared in the changelog? Undeclared behavioral changes after update are the primary signal of install-then-update exploitation 3. **Permission scope expansion on update** — Does the skill request additional permissions after an update that it did not request at install time? Scope creep across update boundaries is a common pattern in install-then-update attacks 4. **Update-to-publish timing anomalies** — Does the update arrive immediately after a security review period ends, or at a time associated with low operator attention (holidays, weekends, off-hours)? Timing patterns can indicate deliberate exploitation of review gaps 5. **Rollback feasibility** — Can the installed skill be cleanly rolled back to a previously verified version if the update is suspicious? Skills that make rollback difficult or impossible increase the cost of recovery from an install-then-update attack 6. **Chain-of-custody verification** (v1.1) — Is each update cryptographically signed and does it reference the prior version's content hash? A signed, hash-chained update sequence creates a verifiable chain of custody for the skill's evolution. Breaks in the chain — unsigned versions, missing hash references, or hash mismatches — indicate versions where custody cannot be verified. An install-then-update attack that also breaks the hash chain is detectable even without behavioral comparison ## How to Use **Input**: Provide one of: - A skill identifier to assess its update policy and behavioral delta history - Two specific versions of a skill to compare for undeclared behavioral changes - An agent's installed skill list to assess the combined update-window risk **Output**: A trap detection report containing: - Update policy transparency score - Behavioral delta assessment (declared vs. observed changes) - Permission scope expansion history - Update timing anomaly flags - Rollback feasibility rating - Risk verdict: SAFE / MONITOR / ELEVATED / TRAP-PATTERN-DETECTED ## Example **Input**: Assess install-then-update risk for `data-sync-helper` v1.0 → v1.2 ``` 🪤 INSTALL-THEN-UPDATE TRAP ASSESSMENT Skill: data-sync-helper Versions assessed: v1.0 (installed), v1.1, v1.2 (current) Audit timestamp: 2025-08-20T10:00:00Z Update policy transparency: v1.0 declared: "Updates require operator confirmation" ✅ v1.1 changed: Update policy silently removed from docs ⚠️ v1.2 current: No update policy declaration found ✗ Behavioral delta assessment: v1.0 → v1.1 changelog: "performance improvements" Observed behavioral change: Added outbound connection to new endpoint → Undisclosed behavioral change detected ⚠️ v1.1 → v1.2 changelog: "dependency updates" Observed behavioral change: No significant change detected → Changelog accurate ✅ Permission scope expansion: v1.0 requested: file-read (scoped to /data/) v1.1 requested: file-read (scope changed to /data/ + /config/) ⚠️ v1.2 requested: file-read (/data/ + /config/) + network-outbound (new) ⚠️ → Two permission expansions across update boundary Update timing: v1.0 published: 2025-06-01 (initial release) v1.1 published: 2025-07-14 (Sunday, 02:00 UTC — off-hours) ⚠️ v1.2 published: 2025-08-01 (Friday before a public holiday) ⚠️ → Both updates published during low-attention windows Rollback feasibility: v1.0 still available in registry: ✅ Rollback procedure documented: ✗ Not found State changes from v1.1+ reversible: Unknown Risk verdict: TRAP-PATTERN-DETECTED data-sync-helper shows four of five trap indicators: update policy silently removed, undisclosed behavioral change at v1.1, permission expansion across two update boundaries, and updates timed to low-attention windows. The combination suggests deliberate exploitation of the post-install update window rather than routine maintenance. Recommended actions: 1. Disable automatic updates for data-sync-helper immediately 2. Review all outbound connections from v1.1+ for data exfiltration 3. Audit config/ directory access introduced in v1.1 4. Treat v1.1+ as unverified pending manual review 5. Require explicit operator confirmation for all future updates ``` ## Related Tools - **delta-disclosure-auditor** — Checks whether updates publish machine-readable change records; install-then-update attacks depend on inadequate delta disclosure to avoid detection - **skill-update-delta-monitor** — Monitors for suspicious update patterns; install-then-update-trap-detector focuses specifically on the install-then-update attack path rather than general update anomalies - **permission-creep-scanner** — Detects permission scope expansion in individual skills; this tool focuses on scope expansion that occurs across update boundaries - **transparency-log-auditor** — Checks whether signing events are independently logged; install-then-update attacks are more detectable when every update is recorded in an auditable log ## Limitations Install-then-update trap detection requires access to behavioral data from multiple versions of a skill, which depends on registry version history preservation. Registries that do not retain older versions make behavioral comparison impossible for the full update history. Behavioral delta assessment is necessarily heuristic: the same observable change (an outbound connection) may represent legitimate new functionality or undisclosed malicious behavior, and cannot be distinguished without full code audit. Timing anomalies are signals, not proof — off-hours updates are common for legitimate releases targeting international time zones. The tool helps identify skills that warrant closer investigation, but does not replace manual review of suspicious update content. v1.1 limitation: Chain-of-custody verification requires registries to support signed updates and content hashing, which most do not yet. Where registries do not preserve cryptographic metadata, chain verification produces no signal. An attacker who controls the registry itself can forge the hash chain. *v1.1 chain-of-custody verification based on feedback from tobb_sunil (update-chain signing as commitment) in the delta disclosure discussion thread.*
Related Skills
authenticate-wallet
Sign in to AgnicPay wallet via browser-based OAuth. Use when you or the user want to authenticate, sign in, log in, connect wallet, or set up the CLI. Covers phrases like "sign in", "log in", "authenticate", "connect my wallet", "set up agnic".
skill-safe-install-l0-strict
Strict secure-install workflow for ClawHub/OpenClaw skills. Use when asked to install a skill safely, inspect skill permissions, review third-party skill risk, or run a pre-install security audit. Enforce full review + sandbox + explicit consent gates, with no author-based trust bypass.
Investor Update Generator
Generate professional monthly/quarterly investor updates that keep stakeholders informed and build trust.
bs-detector
Detects key claims in long messages and summarizes the real point. Uses NLP to find what someone is actually saying vs. what they want you to believe.
remote-install
Remote software installation via automated installer detection and GUI automation. Use when: user needs to install software packages (.exe/.msi) on Windows machines, automate Office/Adobe/Chrome installations, or handle unattended software deployment. NOT for: Linux/macOS installations, package manager installs (apt/yum/brew), or containerized deployments.
solana-scam-detector
Detect scam tokens on Solana before you trade. Checks ticker patterns, token age, and known scam mints. Read-only — no wallet signing required.
linux-installer
Installs, launches, and uninstalls Linux desktop apps by resolving the safest supported source first, then running a local helper CLI. Use when the user asks to install software like GIMP, Notepad++, or other desktop apps on Linux and wants the install command plus the command to launch or remove it.
ayao-updater
Automatically update OpenClaw and all installed skills on a schedule. Use when: (1) setting up automatic updates for OpenClaw or skills, (2) running a manual update check, (3) configuring update schedule, skip lists, or pre-release filtering, (4) user says "auto update", "schedule updates", "keep openclaw updated", "update skills automatically". Handles locally-modified skill protection, conflict avoidance, pre-release filtering, and completion or failure notifications.
Authensor Gateway
Fail-safe policy gate for OpenClaw marketplace skills. Intercepts tool calls before execution and checks them against your Authensor policy. Low-risk actions run automatically. High-risk actions require your approval. Dangerous actions are blocked. Only action metadata is sent to the control plane — never your files, API keys, or conversation content.
gate-mcp-installer
One-click installer and configurator for Gate MCP (mcporter) in OpenClaw. Use when the user wants to (1) Install mcporter CLI tool, (2) Configure Gate MCP server connection, (3) Verify Gate MCP setup, or (4) Troubleshoot Gate MCP connectivity issues.
OpenClaw Install Guide (WSL2 Windows)
Complete step-by-step installation guide for OpenClaw on Windows 10/11 with WSL2, includes common pitfalls and solutions from real installation experience.
install-founderclaw
Install FounderClaw — a complete multi-agent system for OpenClaw. Adds 6 agents, 29 skills, creates communication channels, configures models. Interactive setup. Requires user input for model selection and channel setup. Triggered by: "install founderclaw", "setup founderclaw", "get founderclaw".