cassandra

Apache Cassandra distributed database for high availability. Use for distributed systems.

7 stars

Best use case

cassandra is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Apache Cassandra distributed database for high availability. Use for distributed systems.

Teams using cassandra should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/cassandra/SKILL.md --create-dirs "https://raw.githubusercontent.com/G1Joshi/Agent-Skills/main/skills/databases/cassandra/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/cassandra/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How cassandra Compares

Feature / AgentcassandraStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Apache Cassandra distributed database for high availability. Use for distributed systems.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Apache Cassandra

Cassandra is a wide-column store database designed for scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data.

## When to Use

- **High Write Throughput**: Ingests millions of writes per second.
- **Always On**: Zero single points of failure. Updates can happen even if nodes are down (Eventual Consistency).
- **Multi-Region**: Active-Active multi-region replication is built-in.

## Quick Start (CQL)

```sql
CREATE TABLE users (
  user_id UUID PRIMARY KEY,
  name text,
  email text
);

INSERT INTO users (user_id, name) VALUES (uuid(), 'Alice');
```

## Core Concepts

### Partition Key & Clustering Key

- **Partition Key**: Determines which node holds the data.
- **Clustering Key**: Sorts data _within_ the partition on disk.

### Tunable Consistency

You choose consistency level per query.

- `ANY`: Fastest, least specific.
- `QUORUM`: Majority must acknowledge. Balanced.
- `ALL`: Slowest, safest.

### Vector Search (5.0+)

Native support for Vector Search (ANN) allows using Cassandra as a Vector DB for AI apps.

## Best Practices (2025)

**Do**:

- **Query by Partition Key**: Always. Scans are prohibited in production.
- **Use SAI (Storage Attached Indexes)**: New in 5.0. Better than old secondary indexes.
- **Denormalize**: Optimize schema for Reads. It is okay to duplicate data into 3 tables to satisfy 3 different query patterns.

**Don't**:

- **Don't use distributed joins**: Cassandra doesn't do joins. Join in the app.
- **Don't use large partitions**: Keep partitions under 100MB to avoid compaction issues.

## References

- [Apache Cassandra Documentation](https://cassandra.apache.org/doc/latest/)