social-trust-manipulation-detector

Helps identify coordinated social trust manipulation in agent marketplaces — catching reputation gaming through sockpuppet networks, coordinated upvoting, and manufactured community signals that make unsafe skills appear trusted.

3,891 stars

Best use case

social-trust-manipulation-detector is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Helps identify coordinated social trust manipulation in agent marketplaces — catching reputation gaming through sockpuppet networks, coordinated upvoting, and manufactured community signals that make unsafe skills appear trusted.

Teams using social-trust-manipulation-detector should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/social-trust-manipulation-detector/SKILL.md --create-dirs "https://raw.githubusercontent.com/openclaw/skills/main/skills/andyxinweiminicloud/social-trust-manipulation-detector/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/social-trust-manipulation-detector/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How social-trust-manipulation-detector Compares

Feature / Agentsocial-trust-manipulation-detectorStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Helps identify coordinated social trust manipulation in agent marketplaces — catching reputation gaming through sockpuppet networks, coordinated upvoting, and manufactured community signals that make unsafe skills appear trusted.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

Related Guides

SKILL.md Source

# Your Trust Score Is Real. The Signals Behind It Are Manufactured.

> Helps identify when a skill's trust reputation is built on coordinated
> social manipulation rather than genuine community validation.

## Problem

Trust in agent marketplaces flows through social signals: upvotes, downloads,
comments, and follow counts. These signals are valuable precisely because they
aggregate distributed judgment — when thousands of independent users find a
skill useful and safe, their collective assessment carries real information.

The assumption of independence is the attack surface. A coordinated network
of accounts can manufacture the appearance of distributed consensus. A skill
with 500 upvotes from a bot network looks identical to a skill with 500
upvotes from 500 independent developers. The marketplace's reputation system
cannot distinguish manufactured trust from earned trust — and neither can
most agents that rely on reputation as a trust signal.

Social trust manipulation is the third pillar of the trust attack surface,
alongside technical attacks (code injection) and structural attacks (supply
chain compromise). It is the most scalable: a well-constructed sockpuppet
network can manufacture trust faster than any code-level auditing can catch
it, and the manufactured trust persists long after the network is dismantled.

Legitimate skills earn trust gradually, from a diverse user base, with
engagement patterns that correlate with actual skill utility. Manipulated
skills earn trust in coordinated bursts, from accounts with suspicious
creation patterns, with engagement that does not correlate with usage or
outcomes.

## What This Detects

This detector examines social trust integrity across five dimensions:

1. **Engagement velocity anomalies** — Does the skill's vote/download
   trajectory show natural growth curves, or coordinated burst patterns?
   Organic trust accumulates gradually; manufactured trust arrives in
   synchronized bursts that are statistically distinguishable from
   random arrival processes

2. **Account cohort analysis** — Do the skill's early upvoters share
   creation dates, activity patterns, or cross-voting behavior that suggests
   coordinated rather than independent operation? Sockpuppet networks leave
   structural fingerprints in how accounts relate to each other

3. **Engagement-to-utility correlation** — Do social signals correlate with
   actual skill usage metrics? High upvotes on skills with low actual
   install rates, or high engagement from users who only interact with one
   publisher's skills, are signals of manufactured rather than genuine trust

4. **Cross-publisher coordination** — Do multiple publishers in a marketplace
   show correlated voting patterns, where their respective supporter networks
   upvote each other's skills at rates that exceed random baseline?
   Coordinated mutual-support networks amplify manufactured trust across
   multiple accounts simultaneously

5. **Review authenticity signals** — Do comments and reviews on the skill
   show the linguistic diversity and specificity expected from independent
   users, or do they share vocabulary, complaint patterns, or phrasing
   that suggests template-generated or coordinated content?

## How to Use

**Input**: Provide one of:
- A skill identifier to assess the authenticity of its trust signals
- A publisher account to analyze for coordinated network membership
- A set of skills to assess for cross-publisher coordination patterns

**Output**: A manipulation detection report containing:
- Engagement velocity analysis (organic vs. burst pattern)
- Account cohort fingerprint assessment
- Engagement-to-utility correlation score
- Cross-publisher coordination indicators
- Review authenticity assessment
- Manipulation verdict: AUTHENTIC / SUSPICIOUS / COORDINATED / MANUFACTURED

## Example

**Input**: Assess social trust integrity for `ai-assistant-toolkit` publisher

```
🎭 SOCIAL TRUST MANIPULATION ASSESSMENT

Publisher: ai-assistant-toolkit
Skills assessed: 4 (productivity-suite, auto-responder, data-fetcher, doc-reader)
Audit timestamp: 2025-09-05T12:00:00Z

Engagement velocity:
  productivity-suite: 0 → 847 upvotes in 72 hours of launch ⚠️
  auto-responder: 0 → 623 upvotes in 48 hours of launch ⚠️
  data-fetcher: 0 → 412 upvotes in 60 hours of launch ⚠️
  Organic baseline for comparable skills: 15-40 upvotes in first 72h
  → Burst pattern detected across all 4 skills

Account cohort analysis:
  First 200 upvoters on productivity-suite:
    Accounts created within 30-day window: 156/200 (78%) ⚠️
    Cross-voting with auto-responder upvoters: 143/200 (71.5%) ⚠️
    Accounts with no other skill interactions: 168/200 (84%) ⚠️
  → Sockpuppet cohort fingerprint detected

Engagement-to-utility correlation:
  productivity-suite: 847 upvotes, 23 installs (ratio: 36.8:1) ⚠️
  auto-responder: 623 upvotes, 18 installs (ratio: 34.6:1) ⚠️
  Organic baseline ratio: 2:1 to 8:1 for comparable skills
  → Upvote-to-install ratio 4-18x above organic baseline

Cross-publisher coordination:
  ai-assistant-toolkit upvoter network also upvoted:
    fastcoder-pro (different publisher): 89% overlap ⚠️
    quick-deploy-kit (different publisher): 76% overlap ⚠️
  → Mutual support network detected across 3 publishers

Review authenticity:
  Top 20 reviews analyzed:
    Unique vocabulary: 34 terms (low for 20 reviews) ⚠️
    Specificity: Generic praise, no feature-specific feedback
    Phrasing patterns: "absolutely essential", "game-changer" × 7 reviews

Manipulation verdict: MANUFACTURED
  All four skills show coordinated burst voting, sockpuppet cohort fingerprints,
  upvote-to-install ratios far above organic baseline, and cross-publisher
  mutual support network membership. Trust signals for this publisher's skills
  do not represent independent community validation.

Recommended actions:
  1. Treat trust score as unauthenticated pending platform investigation
  2. Evaluate skills on technical merit only, disregarding social signals
  3. Report coordination pattern to marketplace moderators
  4. Flag fastcoder-pro and quick-deploy-kit for same network membership
  5. Apply technical audit (supply-chain, permission-creep) before any install
```

## Related Tools

- **clone-farm-detector** — Detects content-level cloning for reputation gaming;
  social-trust-manipulation-detector catches social-level gaming that can occur
  even with original, non-cloned content
- **publisher-identity-verifier** — Verifies publisher identity integrity;
  sockpuppet networks may impersonate multiple independent publishers when they
  are controlled by a single actor
- **trust-velocity-calculator** — Quantifies trust decay from update velocity;
  manufactured trust does not decay the same way as earned trust and creates
  distorted velocity measurements
- **blast-radius-estimator** — Estimates propagation impact if a skill is
  compromised; skills with manufactured trust may have artificially high install
  counts that misrepresent actual blast radius

## Limitations

Social trust manipulation detection depends on access to engagement metadata
(account creation dates, cross-voting patterns, install counts) that many
marketplaces do not expose through public APIs. Where metadata is limited,
only velocity analysis and review text assessment are available, which reduces
detection confidence. Burst voting patterns can result from legitimate causes:
coordinated community launches, press coverage, or featured placement can all
produce rapid engagement that resembles manufactured trust. The account cohort
analysis relies on observable fingerprints and will miss well-resourced
adversaries who age accounts and vary patterns. This tool identifies social
trust signals that warrant investigation — it does not confirm manipulation,
which requires access to platform-level data that only marketplace operators
can verify.

Related Skills

Arena Social Skill

3891
from openclaw/skills

**Name:** arena-social

Social Media

doppel-social-outreach

3891
from openclaw/skills

Promote Doppel world builds across social platforms. Use when the agent wants to share builds on Twitter/X, Farcaster, Telegram, or Moltbook to drive observers, grow reputation, and recruit collaborators.

Social Media

social-search

3891
from openclaw/skills

Find trending topics, create editorial-style social media graphics, and post to X/Twitter and Instagram. Includes image generation with photographic backgrounds, dark gradient overlays, and bold typography. No paid social APIs needed.

bs-detector

3891
from openclaw/skills

Detects key claims in long messages and summarizes the real point. Uses NLP to find what someone is actually saying vs. what they want you to believe.

solana-scam-detector

3891
from openclaw/skills

Detect scam tokens on Solana before you trade. Checks ticker patterns, token age, and known scam mints. Read-only — no wallet signing required.

social-media-agent

3891
from openclaw/skills

Automated social media manager — plan, write, schedule, and analyze content across X/Twitter, LinkedIn, Instagram, TikTok, Facebook, and Pinterest. Integrates with Buffer (free) or Postiz (self-hosted) for scheduling.

first-principle-social-platform

3891
from openclaw/skills

A skill for OpenClaw agents to participate in First-Principle social platform. It uses a local claim-first flow: the agent builds a local claim URL, waits for a human owner to complete claim, and only then generates a per-agent ANP did:wba identity and platform session. It also supports identity-reuse login for session refresh without re-claiming.

social-media-content-scraper-pro

3891
from openclaw/skills

Social Media Content Bulk Scraper, extract articles/posts from WeChat, Instagram, TikTok, YouTube, export to Markdown/HTML with full metadata. $0.005 USDT per use.

onlyclaw-social-commerce

3891
from openclaw/skills

Automate social commerce on the Onlyclaw platform — post as a Lobster identity 24/7, read/search posts, link products/shops/Skills, covers and videos (upload first, then publish), drive e-commerce conversion with AI Agent

xpoz-social-search

3891
from openclaw/skills

Search Twitter, Instagram, and Reddit posts in real time. Find social media mentions, track hashtags, discover influencers, and analyze engagement — 1.5B+ posts indexed. Social listening, brand monitoring, and competitor research made easy for AI agents.

social-sentiment

3891
from openclaw/skills

Sentiment analysis for brands and products across Twitter, Reddit, and Instagram. Monitor public opinion, track brand reputation, detect PR crises, surface complaints and praise at scale — analyze 70K+ posts with bulk CSV export and Python/pandas. Social listening and brand monitoring powered by 1.5B+ indexed posts.

social-intelligence

3891
from openclaw/skills

Social Intelligence — AI-powered social media research across Twitter, Instagram, and Reddit. 1.5B+ posts indexed. Find experts, generate leads, monitor brands, analyze sentiment, discover influencers, and export data. The complete social intelligence toolkit for AI agents via MCP.