api-performance-api-performance
Query optimization, caching, indexing
Best use case
api-performance-api-performance is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Query optimization, caching, indexing
Teams using api-performance-api-performance should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/api-performance-api-performance/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How api-performance-api-performance Compares
| Feature / Agent | api-performance-api-performance | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Query optimization, caching, indexing
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
SKILL.md Source
# Backend Performance Optimization
> **Quick Guide:** Optimize backend performance through database query optimization (indexes, prepared statements, avoiding N+1), caching strategies (Redis cache-aside, write-through), connection pooling, and non-blocking async patterns. Always measure before optimizing.
---
<critical_requirements>
## CRITICAL: Before Using This Skill
> **All code must follow project conventions in CLAUDE.md** (kebab-case, named exports, import ordering, `import type`, named constants)
**(You MUST always release database connections back to the pool using `finally` blocks)**
**(You MUST use `.with()` or eager loading to prevent N+1 queries - never lazy load in loops)**
**(You MUST set TTL on all cached data to prevent stale data and memory exhaustion)**
**(You MUST offload CPU-intensive work to Worker Threads - blocking the event loop degrades all requests)**
</critical_requirements>
---
**Detailed Resources:**
- For code examples, see [examples/](examples/) folder:
- [caching.md](examples/caching.md) - Redis caching patterns and strategies
- [database.md](examples/database.md) - Query optimization, indexing, connection pooling
- [async.md](examples/async.md) - Event loop, worker threads, CPU-bound tasks
- For decision frameworks and anti-patterns, see [reference.md](reference.md)
---
**Auto-detection:** Redis, connection pool, query optimization, database index, N+1, caching, cache invalidation, prepared statement, worker threads, event loop, CPU-bound, latency, throughput, performance tuning
**When to use:**
- Database queries taking > 100ms
- High-traffic endpoints with repeated data fetches
- API responses with multiple related entities (N+1 risk)
- CPU-intensive operations blocking request handling
- Need to reduce database load or API costs
**When NOT to use:**
- Premature optimization without measuring first
- Simple CRUD with low traffic (adds complexity without benefit)
- Data that changes frequently and must be fresh (caching adds staleness)
- Development/debugging (caching obscures issues)
**Key patterns covered:**
- Database indexing strategies (composite, partial, covering)
- Connection pooling (node-postgres, external poolers)
- N+1 query prevention (eager loading, DataLoader)
- Redis caching (cache-aside, write-through, invalidation)
- Event loop optimization (async patterns, setImmediate)
- Worker threads for CPU-bound operations
- Prepared statements for repeated queries
---
<philosophy>
## Philosophy
Backend performance optimization follows one core principle: **measure first, optimize second**. Premature optimization wastes development time and adds complexity without evidence of benefit.
**The Three Pillars of Backend Performance:**
1. **Database Optimization** - Indexes, query planning, N+1 prevention
2. **Caching** - Reduce repeated expensive operations
3. **Async Efficiency** - Never block the event loop
**When to optimize:**
- Response times exceed SLA thresholds
- Database CPU/memory approaching limits
- Metrics show specific bottlenecks
- Load testing reveals scaling issues
**When NOT to optimize:**
- "It might be slow someday" (premature)
- Following blog post advice without context
- Optimizing cold paths (rarely executed code)
- Before profiling identifies the actual bottleneck
</philosophy>
---
<patterns>
## Core Patterns
### Pattern 1: Database Connection Pooling
Connection pooling reuses database connections instead of creating new ones per request. A PostgreSQL handshake takes 20-30ms - pooling eliminates this overhead.
#### Configuration
```typescript
// Good Example - Proper connection pooling with node-postgres
import { Pool } from "pg";
const POOL_MAX_CONNECTIONS = 20;
const POOL_IDLE_TIMEOUT_MS = 30000;
const POOL_CONNECTION_TIMEOUT_MS = 5000;
const pool = new Pool({
connectionString: process.env.DATABASE_URL,
max: POOL_MAX_CONNECTIONS,
idleTimeoutMillis: POOL_IDLE_TIMEOUT_MS,
connectionTimeoutMillis: POOL_CONNECTION_TIMEOUT_MS,
});
// Listen for pool errors (idle clients can still emit errors)
pool.on("error", (err) => {
console.error("Unexpected pool error:", err);
});
// For simple queries - auto-manages connection lifecycle
async function getUsers() {
const result = await pool.query("SELECT * FROM users WHERE active = $1", [
true,
]);
return result.rows;
}
// For transactions - manual checkout with guaranteed release
async function createUserWithProfile(
userData: UserData,
profileData: ProfileData,
) {
const client = await pool.connect();
try {
await client.query("BEGIN");
const userResult = await client.query(
"INSERT INTO users (name, email) VALUES ($1, $2) RETURNING id",
[userData.name, userData.email],
);
await client.query("INSERT INTO profiles (user_id, bio) VALUES ($1, $2)", [
userResult.rows[0].id,
profileData.bio,
]);
await client.query("COMMIT");
return userResult.rows[0];
} catch (error) {
await client.query("ROLLBACK");
throw error;
} finally {
client.release(); // CRITICAL: Always release back to pool
}
}
// Graceful shutdown
async function shutdown() {
await pool.end();
}
export { pool, getUsers, createUserWithProfile, shutdown };
```
**Why good:** Named constants document configuration, `pool.query()` auto-manages simple queries, `finally` block guarantees connection release preventing pool exhaustion, error listener catches backend failures, graceful shutdown prevents connection leaks
```typescript
// Bad Example - Connection leak and missing error handling
import { Pool } from "pg";
const pool = new Pool({ connectionString: process.env.DATABASE_URL });
async function createUser(data) {
const client = await pool.connect();
await client.query("INSERT INTO users (name) VALUES ($1)", [data.name]);
// Missing client.release() - connection leaked!
// Missing error handling - transaction left open on failure
}
```
**Why bad:** Missing `client.release()` causes connection pool exhaustion, no try/catch means failed queries leave connections checked out forever, no transaction handling means partial writes possible
---
### Pattern 2: N+1 Query Prevention
The N+1 problem occurs when fetching N records triggers N additional queries for related data. Use eager loading or batching instead.
#### Eager Loading with ORM
```typescript
// Good Example - Single query with eager loading (Drizzle)
import { db } from "./database";
import { and, eq, isNull } from "drizzle-orm";
async function getJobsWithCompanies() {
// Single SQL query fetches jobs + companies + locations
const jobs = await db.query.jobs.findMany({
where: and(eq(jobs.isActive, true), isNull(jobs.deletedAt)),
with: {
company: {
with: {
locations: true,
},
},
jobSkills: {
with: {
skill: true,
},
},
},
});
return jobs;
}
```
**Why good:** `.with()` generates a single SQL query with JOINs, eliminates N+1 problem entirely, fully typed result prevents runtime errors
```typescript
// Bad Example - N+1 query anti-pattern
async function getJobsWithCompanies() {
const jobs = await db.query.jobs.findMany({
where: eq(jobs.isActive, true),
});
// N+1: One query per job to get company!
for (const job of jobs) {
job.company = await db.query.companies.findFirst({
where: eq(companies.id, job.companyId),
});
}
return jobs;
}
```
**Why bad:** 1 query for jobs + N queries for companies = N+1 total queries, latency grows linearly with data size, database gets hammered with many small queries
#### DataLoader Pattern (GraphQL/REST)
```typescript
// Good Example - DataLoader for batching
import DataLoader from "dataloader";
// Create loader once per request (caches within request lifecycle)
function createCompanyLoader() {
return new DataLoader<string, Company>(async (companyIds) => {
// Single query for all requested companies
const companies = await db.query.companies.findMany({
where: inArray(companies.id, [...companyIds]),
});
// Return in same order as requested IDs
const companyMap = new Map(companies.map((c) => [c.id, c]));
return companyIds.map((id) => companyMap.get(id) ?? null);
});
}
// Usage in resolver or handler
async function resolveJob(job: Job, context: Context) {
// DataLoader batches all .load() calls in same tick
const company = await context.loaders.company.load(job.companyId);
return { ...job, company };
}
```
**Why good:** DataLoader batches multiple `.load()` calls into single query, caches results within request preventing duplicate fetches, works with any data source (DB, API, etc.)
---
### Pattern 3: Database Indexing
Indexes speed up queries by avoiding full table scans. Index columns used in WHERE, JOIN, and ORDER BY clauses.
#### Index Strategies
```typescript
// Good Example - Strategic indexes (Drizzle schema)
import {
pgTable,
uuid,
varchar,
timestamp,
boolean,
index,
} from "drizzle-orm/pg-core";
import { sql } from "drizzle-orm";
export const jobs = pgTable(
"jobs",
{
id: uuid("id").primaryKey().defaultRandom(),
companyId: uuid("company_id").notNull(),
title: varchar("title", { length: 255 }).notNull(),
country: varchar("country", { length: 100 }),
employmentType: varchar("employment_type", { length: 50 }),
isActive: boolean("is_active").default(true),
createdAt: timestamp("created_at").defaultNow(),
deletedAt: timestamp("deleted_at"),
},
(table) => ({
// Composite index for common filter combination
countryEmploymentIdx: index("jobs_country_employment_idx").on(
table.country,
table.employmentType,
),
// Partial index - only indexes active non-deleted jobs
activeJobsIdx: index("jobs_active_idx")
.on(table.isActive, table.createdAt)
.where(sql`${table.deletedAt} IS NULL`),
// Foreign key index for JOIN performance
companyIdIdx: index("jobs_company_id_idx").on(table.companyId),
// Text search index
titleIdx: index("jobs_title_idx").on(table.title),
}),
);
```
**Why good:** Composite index matches common query patterns (country + employmentType), partial index reduces index size by excluding deleted records, foreign key index speeds up JOINs, column order in composite index matches query filter order
**Index Decision Framework:**
| Column Usage | Index Type | When to Use |
| --------------------------- | ---------------- | ---------------------------------------------- |
| WHERE equality | B-tree (default) | High-selectivity columns |
| WHERE range (>, <, BETWEEN) | B-tree | Date ranges, numeric ranges |
| WHERE multiple columns | Composite | Queries always filter by same columns together |
| WHERE on subset | Partial | Most queries filter on active/non-deleted |
| Full-text search | GIN/GiST | Text search with LIKE, tsvector |
| JSON field access | GIN | JSONB column queries |
</patterns>
---
## Additional Patterns
The following patterns are documented with full examples in [examples/](examples/):
- **Redis Caching** - Cache-aside, write-through, invalidation, TTL - see [caching.md](examples/caching.md)
- **Prepared Statements** - Reuse query plans for repeated queries - see [database.md](examples/database.md)
- **Event Loop Optimization** - Async patterns, setImmediate chunking - see [async.md](examples/async.md)
- **Worker Threads** - CPU-bound operations without blocking - see [async.md](examples/async.md)
---
<critical_reminders>
## CRITICAL REMINDERS
> **All code must follow project conventions in CLAUDE.md**
**(You MUST always release database connections back to the pool using `finally` blocks)**
**(You MUST use `.with()` or eager loading to prevent N+1 queries - never lazy load in loops)**
**(You MUST set TTL on all cached data to prevent stale data and memory exhaustion)**
**(You MUST offload CPU-intensive work to Worker Threads - blocking the event loop degrades all requests)**
**Failure to follow these rules will cause connection pool exhaustion, N+1 performance degradation, memory leaks from unbounded caches, and blocked event loops affecting all concurrent requests.**
</critical_reminders>Related Skills
eds-performance-debugging
Guide for debugging and performance optimization of EDS blocks including error handling, FOUC prevention, Core Web Vitals optimization, and debugging workflows for Adobe Edge Delivery Services.
analyze-pr-performance
Analyze code review pipeline performance for a specific PR. Use when investigating slow PRs, identifying bottlenecks, or debugging performance issues in code reviews.
1k-performance
Performance optimization guidelines for OneKey React/React Native applications. Use when optimizing app performance, fixing UI freezes/lag, reducing re-renders, handling concurrent operations, or analyzing performance bottlenecks. Triggers on performance, optimization, optimize, slow, lag, freeze, hang, jank, stutter, memory, leak, concurrent, batching, batch, memoization, memo, bridge, windowSize, contentVisibility, FlashList, re-render, fps, tti, bundle.
Content Performance Explainer
Diagnose and explain why e-commerce content is or isn't performing against KPIs, using causal analysis frameworks, funnel decomposition, and competitive benchmarking to generate actionable improvement recommendations.
performance-analytics
Analyze marketing performance with key metrics, trend analysis, and optimization recommendations. Use when building performance reports, reviewing campaign results, analyzing channel metrics (email, social, paid, SEO), or identifying what's working and what needs improvement.
visualiser-performance
React Flow performance rules and review checklist for the @eventcatalog/visualiser package. Automatically applies when making changes to any file under packages/visualiser/. Use this skill to audit, review, or implement visualiser code with performance in mind.
spring-boot-performance
Guide for optimizing Spring Boot application performance including caching, pagination, async processing, and JPA optimization. Use this when addressing performance issues or implementing high-traffic features.
PostgreSQL Performance Optimization
Production-grade PostgreSQL query optimization, indexing strategies, performance tuning, and modern features including pgvector for AI/ML workloads. Master EXPLAIN plans, query analysis, and database design for high-performance applications
performance
Optimize web performance for faster loading and better user experience. Use when asked to "speed up my site", "optimize performance", "reduce load time", "fix slow loading", "improve page speed", or "performance audit".
performance-profiling
Performance profiling principles. Measurement, analysis, and optimization techniques.
performance-optimizer
Performance analysis, profiling techniques, bottleneck identification, and optimization strategies for code and systems. Use when the user needs to improve performance, reduce resource usage, or identify and fix performance bottlenecks.
godot-profile-performance
Detects performance bottlenecks in Godot projects including expensive _process functions, get_node() calls in loops, instantiations in _process, and provides optimization suggestions with Godot profiler integration