Best use case
vmas-simulator-guide is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Vectorized multi-agent reinforcement learning simulator
Teams using vmas-simulator-guide should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/vmas-simulator-guide/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How vmas-simulator-guide Compares
| Feature / Agent | vmas-simulator-guide | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Vectorized multi-agent reinforcement learning simulator
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# VMAS: Vectorized Multi-Agent Simulator Guide
## Overview
VMAS is a vectorized simulator for multi-agent reinforcement learning (MARL) that runs thousands of parallel environments on GPU via PyTorch. It provides a diverse set of 2D cooperative, competitive, and mixed scenarios for benchmarking multi-agent algorithms. Orders of magnitude faster than CPU-based simulators, enabling rapid research iteration on multi-agent coordination problems.
## Installation
```bash
pip install vmas
```
## Quick Start
```python
import vmas
# Create vectorized environment
env = vmas.make_env(
scenario="simple_spread",
num_envs=1024, # Parallel environments
num_agents=3,
device="cuda", # GPU acceleration
continuous_actions=True,
)
# Environment loop
obs = env.reset()
for step in range(100):
# Random actions for demonstration
actions = [env.action_space[i].sample()
for i in range(env.n_agents)]
obs, rewards, dones, infos = env.step(actions)
# obs: list of [num_envs, obs_dim] tensors
# rewards: list of [num_envs] tensors
```
## Scenarios
| Scenario | Type | Agents | Description |
|----------|------|--------|-------------|
| **simple_spread** | Cooperative | 3 | Cover N landmarks |
| **simple_tag** | Competitive | 4 | Predator-prey |
| **transport** | Cooperative | 4 | Move package to goal |
| **wheel** | Cooperative | 4 | Coordination on wheel |
| **flocking** | Cooperative | 5+ | Reynolds flocking |
| **discovery** | Cooperative | 3 | Explore and discover |
| **navigation** | Mixed | N | Multi-agent navigation |
## Integration with MARL Libraries
```python
# With TorchRL
from torchrl.envs import VmasEnv
env = VmasEnv(
scenario="simple_spread",
num_envs=512,
device="cuda",
)
# With RLlib
from ray.rllib.env import MultiAgentEnv
# VMAS provides RLlib-compatible wrapper
# With CleanRL / custom training
import torch
env = vmas.make_env("transport", num_envs=2048, device="cuda")
obs = env.reset()
# All tensors on GPU — train directly without CPU transfer
policy_output = policy_network(obs[0]) # Agent 0 observations
```
## Custom Scenarios
```python
from vmas import Scenario, Agent, World, Landmark
class MyScenario(Scenario):
def make_world(self, batch_dim, device):
world = World(batch_dim=batch_dim, device=device)
world.add_agent(Agent(name="agent_0"))
world.add_agent(Agent(name="agent_1"))
world.add_landmark(Landmark(name="goal"))
return world
def reset_world(self, env, world):
# Randomize positions
for agent in world.agents:
agent.set_pos(torch.rand(env.batch_dim, 2) * 2 - 1)
def reward(self, agent, world):
# Distance to goal
goal = world.landmarks[0]
return -torch.linalg.norm(agent.state.pos - goal.state.pos,
dim=-1)
# Register and use
env = vmas.make_env(MyScenario(), num_envs=512)
```
## Use Cases
1. **MARL research**: Benchmark multi-agent algorithms
2. **Cooperative learning**: Study emergent coordination
3. **Scalability testing**: GPU-accelerated parallel training
4. **Custom scenarios**: Design domain-specific multi-agent tasks
5. **Education**: Teach multi-agent RL concepts
## References
- [VMAS GitHub](https://github.com/proroklab/VectorizedMultiAgentSimulator)
- [VMAS Paper](https://arxiv.org/abs/2207.03530)
- [BenchMARL](https://github.com/facebookresearch/BenchMARL)Related Skills
thuthesis-guide
Write Tsinghua University theses using the ThuThesis LaTeX template
thesis-writing-guide
Templates, formatting rules, and strategies for thesis and dissertation writing
thesis-template-guide
Set up LaTeX templates for PhD and Master's thesis documents
sjtuthesis-guide
Write SJTU theses using the SJTUThesis LaTeX template with full compliance
novathesis-guide
LaTeX thesis template supporting multiple universities and formats
graphical-abstract-guide
Create SVG graphical abstracts for journal paper submissions
beamer-presentation-guide
Guide to creating academic presentations with LaTeX Beamer
plagiarism-detection-guide
Use plagiarism detection tools and ensure manuscript originality
paper-polish-guide
Review and polish LaTeX research papers for clarity and style
grammar-checker-guide
Use grammar and style checking tools to polish academic manuscripts
conciseness-editing-guide
Eliminate wordiness and redundancy in academic prose for clarity
academic-translation-guide
Academic translation, post-editing, and Chinglish correction guide