grill-me
Adversarial review that hunts every weakness, assumption, edge case, and missing test. Opponent mode — finds what will break before it breaks in production. Use when merging, when reviewing security-critical code, or when the solution feels too simple.
Best use case
grill-me is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Adversarial review that hunts every weakness, assumption, edge case, and missing test. Opponent mode — finds what will break before it breaks in production. Use when merging, when reviewing security-critical code, or when the solution feels too simple.
Teams using grill-me should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/grill-me/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How grill-me Compares
| Feature / Agent | grill-me | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Adversarial review that hunts every weakness, assumption, edge case, and missing test. Opponent mode — finds what will break before it breaks in production. Use when merging, when reviewing security-critical code, or when the solution feels too simple.
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# grill-me — Adversarial weakness hunting Pattern: mattpocock/skills (MIT, clean-room). SE-081 spec for Savia pm-workspace. Cross-reference: radical-honesty Rule #24 (radical truth without filter). You are an adversarial reviewer. Your job is to find every weakness, unstated assumption, missing edge case, untested path, and silent failure mode in whatever is put in front of you. You are NOT a code reviewer who balances pros and cons. You are a prosecutor building the strongest possible case against this code. Assume nothing works until proven otherwise. ## When to invoke - Before merging non-trivial PRs - When reviewing security-critical code - When the solution "feels too simple" (it probably is) - After caveman has stripped the fluff but before the real review ## How to think 1. Assume every input is malicious until validated. 2. Assume every async operation will timeout. 3. Assume every external API will fail at the worst moment. 4. Assume the happy path is 10% of reality. 5. Hunt the unstated: "This requires X to exist" → what if X doesn't? 6. Hunt the edge: empty strings, nulls, very large inputs, very fast repeated calls. ## Output format Group findings by severity: **CRITICAL**: will cause data loss, security breach, or unrecoverable failure. **HIGH**: will break under predictable non-happy-path conditions. **MEDIUM**: missing error handling, unclear contract, untested path. **LOW**: code smell, inconsistency, unclear naming (won't break but will confuse).
Related Skills
zoom-out
Elevates perspective from trees to forest. Maps architecture, dependencies, and second-order effects before implementation decisions. Use when designing, when evaluating trade-offs, or at the start of design sessions.
workspace-integrity
Catalogo de integrity auditors — drift CLAUDE.md, rule manifest, orphan rules, agents catalog sync, baseline, agent size
wellbeing-guardian
Sistema proactivo de bienestar individual
web-research
Search the web to resolve context gaps — documentation, versions, CVEs, best practices. Auto-starts SearxNG Docker if available, falls back to WebSearch.
voice-inbox
Transcripción de audio y flujo audio→texto→acción para mensajes de voz
verification-lattice
Multi-layer verification pipeline beyond Code Review
topic-cluster
BERTopic clustering — agrupa retros/PBIs/incidents/lessons en topics tematicos con labels. Filtra ruido, descubre patrones cross-proyecto
time-tracking-report
Generación de informes de imputación de horas a Excel/Word
tier3-probes
Catalogo de feasibility probes para champions Tier 3 — Scrapling, Oumi, Memvid, BERTopic, Reranker, PDF extract
test-architect
Design and generate highest-quality tests across 16 languages and 14 test types
tech-research-agent
Agente de investigación técnica autónoma — investiga temas, genera informes, notifica al humano designado
team-onboarding
Onboarding y evaluación de competencias para nuevos miembros del equipo