retraction-watcher
Automatically scan document reference lists and check against Retraction.
Best use case
retraction-watcher is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Automatically scan document reference lists and check against Retraction.
Teams using retraction-watcher should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/retraction-watcher/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How retraction-watcher Compares
| Feature / Agent | retraction-watcher | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Automatically scan document reference lists and check against Retraction.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
AI Agents for Marketing
Discover AI agents for marketing workflows, from SEO and content production to campaign research, outreach, and analytics.
AI Agents for Startups
Explore AI agent skills for startup validation, product research, growth experiments, documentation, and fast execution with small teams.
AI Agents for Coding
Browse AI agent skills for coding, debugging, testing, refactoring, code review, and developer workflows across Claude, Cursor, and Codex.
SKILL.md Source
# Retraction Watcher A specialized skill for identifying retracted, corrected, or questionable papers in academic reference lists before they compromise research integrity. ## When to Use - Use this skill when the task needs Automatically scan document reference lists and check against Retraction. - Use this skill for evidence insight tasks that require explicit assumptions, bounded scope, and a reproducible output format. - Use this skill when you need a documented fallback path for missing inputs, execution errors, or partial evidence. ## Key Features - Scope-focused workflow aligned to: Automatically scan document reference lists and check against Retraction. - Packaged executable path(s): `scripts/main.py`. - Reference material available in `references/` for task-specific guidance. - Structured execution path designed to keep outputs consistent and reviewable. ## Dependencies See `## Prerequisites` above for related details. - `Python`: `3.10+`. Repository baseline for current packaged skills. - `dataclasses`: `unspecified`. Declared in `requirements.txt`. - `pypdf2`: `unspecified`. Declared in `requirements.txt`. ## Example Usage ```python ## Implementation Details See `## Workflow` above for related details. - Execution model: validate the request, choose the packaged workflow, and produce a bounded deliverable. - Input controls: confirm the source files, scope limits, output format, and acceptance criteria before running any script. - Primary implementation surface: `scripts/main.py`. - Reference guidance: `references/` contains supporting rules, prompts, or checklists. - Parameters to clarify first: input path, output path, scope filters, thresholds, and any domain-specific constraints. - Output discipline: keep results reproducible, identify assumptions explicitly, and avoid undocumented side effects. ## Quick Check Use this command to verify that the packaged script entry point can be parsed before deeper execution. ```bash python -m py_compile scripts/main.py ``` ## Audit-Ready Commands Use these concrete commands for validation. They are intentionally self-contained and avoid placeholder paths. ```bash python -m py_compile scripts/main.py python scripts/main.py --help ``` ## Workflow 1. Confirm the user objective, required inputs, and non-negotiable constraints before doing detailed work. 2. Validate that the request matches the documented scope and stop early if the task would require unsupported assumptions. 3. Use the packaged script path or the documented reasoning path with only the inputs that are actually available. 4. Return a structured result that separates assumptions, deliverables, risks, and unresolved items. 5. If execution fails or inputs are incomplete, switch to the fallback path and state exactly what blocked full completion. ## Purpose Academic misconduct and errors can lead to paper retractions. Citing retracted work undermines research credibility. This skill: - Scans reference lists from manuscripts, papers, or bibliographies - Cross-checks citations against Retraction Watch and other retraction databases - Identifies papers with retraction notices, expressions of concern, or corrections - Provides detailed reports with retraction reasons and dates ## Trigger Conditions Activate this skill when: 1. User provides a document with references and asks to check for retractions 2. User explicitly requests "check my references" or "scan for retracted papers" 3. User submits a bibliography or reference list for verification 4. Pre-submission manuscript review is requested 5. User wants to verify citation integrity ## Input Format Accepted inputs: - PDF files (manuscripts, papers, theses) - Plain text files (.txt, .bib, .ris) - Raw text containing reference lists - URLs to papers or reference lists - Clipboard content with citations ## Output Format ### Report Header ``` 🔍 RETRACTION WATCH REPORT Documents Scanned: [N] References Found: [N] Check Date: [YYYY-MM-DD] ``` ### Status Categories **🔴 RETRACTED** - Paper has been officially retracted - Reason for retraction - Retraction date - Original DOI/PMID - Recommended action: Remove citation **🟡 EXPRESSION OF CONCERN** - Journal has raised concerns - Nature of concern - Date issued - Recommended action: Verify current status, consider alternative sources **🟠 CORRECTED** - Paper has published corrections/errata - Correction details - Date of correction - Recommended action: Check if correction affects cited claims **🟢 CLEAR** - No retraction issues found ## Technical Approach ### Citation Parsing Strategy 1. **Format Detection**: Identify citation style (APA, MLA, Vancouver, Chicago, etc.) 2. **Field Extraction**: Parse DOI, PMID, title, authors, journal, year 3. **Identifier Resolution**: Normalize DOIs (remove prefixes, validate format) 4. **Title Matching**: Extract article titles for fuzzy matching ### Database Checking 1. **Retraction Watch Database** - Primary source for retraction data 2. **Crossref API** - Retraction metadata via "update-type: retraction" 3. **PubMed API** - Retraction notices via publication type filters 4. **Open Retractions** - Aggregated retraction data ### Matching Algorithm - **Exact Match**: DOI/PMID exact match (highest confidence) - **Title Match**: Normalized title comparison (90%+ similarity threshold) - **Author + Year**: Secondary verification for ambiguous matches - **Fuzzy Matching**: Handle minor title variations and typos ## Difficulty Level **Medium-High** - Requires: - Robust citation parsing across multiple formats - API integration with retraction databases - Handling of partial/incomplete citation data - Fuzzy matching for title-based lookups - Rate limiting and caching for API calls ## Quality Criteria A successful scan must: - [ ] Parse >90% of citations correctly from standard formats - [ ] Achieve <1% false positive rate on retraction detection - [ ] Provide actionable recommendations for each flagged citation - [ ] Handle missing DOIs/PMIDs via title matching fallback - [ ] Complete checks within reasonable time (<30s for 50 references) - [ ] Preserve reference numbering for easy identification ## Limitations - Requires internet connection for database lookups - Rate limits may apply to free API tiers - Very recent retractions (<48 hours) may not be indexed - Title-only matching may produce false positives with similar titles - Non-English papers may have limited coverage - Preprint citations (arXiv, bioRxiv) typically not tracked for retractions # Check a PDF manuscript python scripts/main.py --input manuscript.pdf --format detailed # Check a BibTeX file python scripts/main.py --input references.bib --output report.txt # Check raw text python scripts/main.py --text "[paste references here]" # Quick check with summary only python scripts/main.py --input paper.pdf --format summary ``` ## Data Sources - **Retraction Watch Database**: https://retractionwatch.com/ - **Crossref API**: https://api.crossref.org/ - **PubMed E-utilities**: https://www.ncbi.nlm.nih.gov/home/develop/api/ - **Open Retractions**: https://openretractions.com/ ## References See `references/` for: - `citation-formats.md`: Supported citation format specifications - `api-documentation.md`: Database API reference and rate limits - `example-reports/`: Sample output reports for testing --- **Author**: AI Assistant **Version**: 1.0 **Last Updated**: 2026-02-06 **Status**: Ready for use **Requires**: Internet connection for database lookups ## Risk Assessment | Risk Indicator | Assessment | Level | |----------------|------------|-------| | Code Execution | Python scripts with tools | High | | Network Access | External API calls | High | | File System Access | Read/write data | Medium | | Instruction Tampering | Standard prompt guidelines | Low | | Data Exposure | Data handled securely | Medium | ## Security Checklist - [ ] No hardcoded credentials or API keys - [ ] No unauthorized file system access (../) - [ ] Output does not expose sensitive information - [ ] Prompt injection protections in place - [ ] API requests use HTTPS only - [ ] Input validated against allowed patterns - [ ] API timeout and retry mechanisms implemented - [ ] Output directory restricted to workspace - [ ] Script execution in sandboxed environment - [ ] Error messages sanitized (no internal paths exposed) - [ ] Dependencies audited - [ ] No exposure of internal service architecture ## Prerequisites ```text # Python dependencies pip install -r requirements.txt ``` ## Evaluation Criteria ### Success Metrics - [ ] Successfully executes main functionality - [ ] Output meets quality standards - [ ] Handles edge cases gracefully - [ ] Performance is acceptable ### Test Cases 1. **Basic Functionality**: Standard input → Expected output 2. **Edge Case**: Invalid input → Graceful error handling 3. **Performance**: Large dataset → Acceptable processing time ## Lifecycle Status - **Current Stage**: Draft - **Next Review Date**: 2026-03-06 - **Known Issues**: None - **Planned Improvements**: - Performance optimization - Additional feature support ## Output Requirements Every final response should make these items explicit when they are relevant: - Objective or requested deliverable - Inputs used and assumptions introduced - Workflow or decision path - Core result, recommendation, or artifact - Constraints, risks, caveats, or validation needs - Unresolved items and next-step checks ## Error Handling - If required inputs are missing, state exactly which fields are missing and request only the minimum additional information. - If the task goes outside the documented scope, stop instead of guessing or silently widening the assignment. - If `scripts/main.py` fails, report the failure point, summarize what still can be completed safely, and provide a manual fallback. - Do not fabricate files, citations, data, search results, or execution outcomes. ## Input Validation This skill accepts requests that match the documented purpose of `retraction-watcher` and include enough context to complete the workflow safely. Do not continue the workflow when the request is out of scope, missing a critical input, or would require unsupported assumptions. Instead respond: > `retraction-watcher` only handles its documented workflow. Please provide the missing required inputs or switch to a more suitable skill. ## References - [references/audit-reference.md](references/audit-reference.md) - Supported scope, audit commands, and fallback boundaries ## Response Template Use the following fixed structure for non-trivial requests: 1. Objective 2. Inputs Received 3. Assumptions 4. Workflow 5. Deliverable 6. Risks and Limits 7. Next Checks If the request is simple, you may compress the structure, but still keep assumptions and limits explicit when they affect correctness.
Related Skills
crypto-watcher
Monitor crypto wallets and DeFi positions. Get alerts when things change.
project-watcher
项目规划与进度追踪。维护 roadmap,git commit 通知,远程部署感知,与飞书集成。
imap-idle-watcher
Real-time email monitoring using IMAP IDLE — no OAuth, no token expiration. Sets up a persistent connection to any IMAP server (Gmail, Outlook, Yahoo, etc.) and triggers a user-defined command instantly when new email arrives. Runs as a systemd service with auto-reconnect. Use when: (1) setting up email-triggered automation, (2) watching an inbox for new messages in real-time, (3) replacing OAuth-based email polling that keeps breaking due to token expiry, (4) building email-to-webhook or email-to-script pipelines. NOT for: sending email, reading/parsing email bodies, or non-Linux systems without systemd.
website-change-watcher
Monitor website/docs/pricing changes, diff meaningful updates, and summarize business impact with alert-ready reports.
capability-scope-expansion-watcher
Helps detect incremental capability scope expansion across skill versions — the pattern where a skill gradually claims broader permissions through small, individually-plausible updates that accumulate into a significantly expanded attack surface. v1.1 adds risk-class contradiction detection.
whale-watcher
Monitor crypto whale wallets for large transactions. Track big moves on Ethereum, BSC, and other chains. Get alerts when whales move significant amounts.
youtube-watcher
Fetch and read transcripts from YouTube videos for summarization and content extraction. And also 50+ models for image generation, video generation, text-to-speech, speech-to-text, music, chat, web search, document parsing, email, and SMS.
policy-delta-watcher
比较新旧制度或政策差异,指出业务影响、流程影响和需更新的操作手册。;use for policy, diff, governance workflows;do not use for 给法律定性结论, 忽略边缘影响人群.
---
name: article-factory-wechat
humanizer
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
find-skills
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
tavily-search
Use Tavily API for real-time web search and content extraction. Use when: user needs real-time web search results, research, or current information from the web. Requires Tavily API key.