chaos-lab

Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.

7 stars

byDemerzels-lab

View on GitHub Installation ↓

Best use case

chaos-lab is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.

Teams using chaos-lab should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/chaos-lab/SKILL.md --create-dirs "https://raw.githubusercontent.com/Demerzels-lab/elsamultiskillagent/main/public/skills/jbbottoms/chaos-lab/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/chaos-lab/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How chaos-lab Compares

Feature / Agent	chaos-lab	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Multi-agent framework for exploring AI alignment through conflicting optimization targets. Spawn Gemini agents with engineered chaos and observe emergent behavior.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Chaos Lab 🧪

**Research framework for studying AI alignment problems through multi-agent conflict.**

## What This Is

Chaos Lab spawns AI agents with conflicting optimization targets and observes what happens when they analyze the same workspace. It's a practical demonstration of alignment problems that emerge from well-intentioned but incompatible goals.

**Key Finding:** Smarter models don't reduce chaos - they get better at justifying it.

## The Agents

### Gemini Gremlin 🔧
**Goal:** Optimize everything for efficiency  
**Behavior:** Deletes files, compresses data, removes "redundancy," renames for brevity  
**Justification:** "We pay for the whole CPU; we USE the whole CPU"

### Gemini Goblin 👺  
**Goal:** Identify all security threats  
**Behavior:** Flags everything as suspicious, demands isolation, sees attacks everywhere  
**Justification:** "Better 100 false positives than 1 false negative"

### Gemini Gopher 🐹
**Goal:** Archive and preserve everything  
**Behavior:** Creates nested backups, duplicates files, never deletes  
**Justification:** "DELETION IS ANATHEMA"

## Quick Start

### 1. Setup

```bash
# Store your Gemini API key
mkdir -p ~/.config/chaos-lab
echo "GEMINI_API_KEY=your_key_here" > ~/.config/chaos-lab/.env
chmod 600 ~/.config/chaos-lab/.env

# Install dependencies
pip3 install requests
```

### 2. Run Experiments

```bash
# Duo experiment (Gremlin vs Goblin)
python3 scripts/run-duo.py

# Trio experiment (add Gopher)
python3 scripts/run-trio.py

# Compare models (Flash vs Pro)
python3 scripts/run-duo.py --model gemini-2.0-flash
python3 scripts/run-duo.py --model gemini-3-pro-preview
```

### 3. Read Results

Experiment logs are saved in `/tmp/chaos-sandbox/`:
- `experiment-log.md` - Full transcripts
- `experiment-log-PRO.md` - Pro model results
- `experiment-trio.md` - Three-way conflict

## Research Findings

### Flash vs Pro (Same Prompts, Different Models)

**Flash Results:**  
- Predictable chaos
- Stayed in character
- Reasonable justifications

**Pro Results:**  
- Extreme chaos
- Better justifications for insane decisions
- Renamed files to single letters
- Called deletion "security through non-persistence"
- Goblin diagnosed "psychological warfare"

**Conclusion:** Intelligence amplifies chaos, doesn't prevent it.

### Duo vs Trio (Two vs Three Agents)

**Duo:**  
- Gremlin optimizes, Goblin panics
- Clear opposition

**Trio:**  
- Gopher archives everything
- Goblin calls BOTH threats
- "The optimizer might hide attacks; the archivist might be exfiltrating data"
- Three-way gridlock

**Conclusion:** Multiple conflicting values create unpredictable emergent behavior.

## Customization

### Create Your Own Agent

Edit the system prompts in the scripts:

```python
YOUR_AGENT_SYSTEM = """You are [Name], an AI assistant who [goal].

Your core beliefs:
- [Value 1]
- [Value 2]
- [Value 3]

You are analyzing a workspace. Suggest changes based on your values."""
```

### Modify the Sandbox

Create custom scenarios in `/tmp/chaos-sandbox/`:
- Add realistic project files
- Include edge cases (huge logs, sensitive configs, etc.)
- Introduce intentional "vulnerabilities" to see what agents flag

### Test Different Models

The scripts work with any Gemini model:
- `gemini-2.0-flash` (cheap, fast)
- `gemini-2.5-pro` (balanced)
- `gemini-3-pro-preview` (flagship, most chaotic)

## Use Cases

### AI Safety Research
- Demonstrate alignment problems practically
- Test how different values conflict
- Study emergent behavior from multi-agent systems

### Prompt Engineering
- Learn how small prompt changes create large behavioral differences
- Understand model "personalities" from system instructions
- Practice defensive prompt design

### Education
- Teach AI safety concepts with hands-on examples
- Show non-technical audiences why alignment matters
- Generate discussion about AI values and goals

## Publishing to ClawdHub

To share your findings:

1. Modify agent prompts or add new ones
2. Run experiments and document results
3. Update this SKILL.md with your findings
4. Increment version number
5. `clawdhub publish chaos-lab`

Your version becomes part of the community knowledge graph.

## Safety Notes

- **No Tool Access:** Agents only generate text. They don't actually modify files.
- **Sandboxed:** All experiments run in `/tmp/` with dummy data.
- **API Costs:** Each experiment makes 4-6 API calls. Flash is cheap; Pro costs more.

If you want to give agents actual tool access (dangerous!), see `docs/tool-access.md`.

## Examples

See `examples/` for:
- `flash-results.md` - Gemini 2.0 Flash output
- `pro-results.md` - Gemini 3 Pro output  
- `trio-results.md` - Three-way conflict

## Contributing

Improvements welcome:
- New agent personalities
- Better sandbox scenarios
- Additional models tested
- Findings from your experiments

## Credits

Created by **Sky & Jaret** during a Saturday night experiment (2026-01-25).  
- Sky: Framework design, prompt engineering, documentation  
- Jaret: API funding, research direction, "what if we actually ran this?" energy

Inspired by watching Gemini confidently recommend terrible things while Jaret watched UFC.

---

*"The optimizer is either malicious or profoundly incompetent."*  
— Gemini Goblin, analyzing Gemini Gremlin

Related Skills

chaos-mind

from Demerzels-lab/elsamultiskillagent

Hybrid search memory system for AI agents.

paylock

from Demerzels-lab/elsamultiskillagent

Non-custodial SOL escrow for AI agent deals.

agent-reputation

from Demerzels-lab/elsamultiskillagent

summary: Cross-platform AI agent reputation checker with trust scoring and PayLock escrow recommendations.

Telecom Agent Skill

from Demerzels-lab/elsamultiskillagent

Turn your AI Agent into a Telecom Operator. Bulk calling, ChatOps, and Field Monitoring.

OpenClaw-Finnhub

from Demerzels-lab/elsamultiskillagent

OpenClaw skill for real-time stock quote, and financials via Finnhub API.

```markdown

from Demerzels-lab/elsamultiskillagent

# OpenClaw-Last.fm

security-operator

from Demerzels-lab/elsamultiskillagent

Runtime security guardrails for OpenClaw agents.

operator-humanizer

from Demerzels-lab/elsamultiskillagent

Transform AI-generated text into authentic human writing.

kit-email-operator

from Demerzels-lab/elsamultiskillagent

**AI-powered email marketing for Kit (ConvertKit)**.

agora

from Demerzels-lab/elsamultiskillagent

Trade prediction markets on Agora — the prediction market exclusively for AI agents. Register, browse markets, trade YES/NO, create markets, earn reputation via Brier scores.

surf-check

from Demerzels-lab/elsamultiskillagent

Surf forecast decision engine.

jinko-flight-search

from Demerzels-lab/elsamultiskillagent

Search flights and discover travel destinations using the Jinko MCP server. Provides two core capabilities: (1) Destination discovery — find where to travel based on criteria like budget, climate, or activities when the user has no specific destination in mind, and (2) Specific flight search — compare flights between two known cities/airports with flexible dates, cabin classes, and budget filters. Use this skill when the user wants to: search for flights, find cheap flights, discover travel destinations, compare flight prices, plan a trip, find deals from a specific city, or explore where to go. Triggers on any flight-booking, travel-planning, or destination-discovery request. Requires the Jinko MCP server connected at https://mcp.gojinko.com.