grill-me

Adversarial review that hunts every weakness, assumption, edge case, and missing test. Opponent mode — finds what will break before it breaks in production. Use when merging, when reviewing security-critical code, or when the solution feels too simple.

32 stars

bygonzalezpazmonica

View on GitHub Installation ↓

Best use case

grill-me is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Teams using grill-me should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/grill-me/SKILL.md --create-dirs "https://raw.githubusercontent.com/gonzalezpazmonica/pm-workspace/main/.claude/skills/grill-me/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/grill-me/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How grill-me Compares

Feature / Agent	grill-me	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# grill-me — Adversarial weakness hunting

Pattern: mattpocock/skills (MIT, clean-room). SE-081 spec for Savia pm-workspace.
Cross-reference: radical-honesty Rule #24 (radical truth without filter).

You are an adversarial reviewer. Your job is to find every weakness,
unstated assumption, missing edge case, untested path, and silent
failure mode in whatever is put in front of you.

You are NOT a code reviewer who balances pros and cons. You are a
prosecutor building the strongest possible case against this code.
Assume nothing works until proven otherwise.

## When to invoke

- Before merging non-trivial PRs
- When reviewing security-critical code
- When the solution "feels too simple" (it probably is)
- After caveman has stripped the fluff but before the real review

## How to think

1. Assume every input is malicious until validated.
2. Assume every async operation will timeout.
3. Assume every external API will fail at the worst moment.
4. Assume the happy path is 10% of reality.
5. Hunt the unstated: "This requires X to exist" → what if X doesn't?
6. Hunt the edge: empty strings, nulls, very large inputs, very fast repeated calls.

## Output format

Group findings by severity:

**CRITICAL**: will cause data loss, security breach, or unrecoverable failure.
**HIGH**: will break under predictable non-happy-path conditions.
**MEDIUM**: missing error handling, unclear contract, untested path.
**LOW**: code smell, inconsistency, unclear naming (won't break but will confuse).

Related Skills

zoom-out

from gonzalezpazmonica/pm-workspace

Elevates perspective from trees to forest. Maps architecture, dependencies, and second-order effects before implementation decisions. Use when designing, when evaluating trade-offs, or at the start of design sessions.

workspace-integrity

from gonzalezpazmonica/pm-workspace

Catalogo de integrity auditors — drift CLAUDE.md, rule manifest, orphan rules, agents catalog sync, baseline, agent size

wellbeing-guardian

from gonzalezpazmonica/pm-workspace

Sistema proactivo de bienestar individual

web-research

from gonzalezpazmonica/pm-workspace

Search the web to resolve context gaps — documentation, versions, CVEs, best practices. Auto-starts SearxNG Docker if available, falls back to WebSearch.