cohere-sdk-patterns
Apply production-ready Cohere SDK patterns for TypeScript and Python. Use when implementing Cohere integrations, refactoring SDK usage, or establishing team coding standards for Cohere API v2. Trigger with phrases like "cohere SDK patterns", "cohere best practices", "cohere code patterns", "idiomatic cohere", "cohere wrapper".
Best use case
cohere-sdk-patterns is best used when you need a repeatable AI agent workflow instead of a one-off prompt.
Apply production-ready Cohere SDK patterns for TypeScript and Python. Use when implementing Cohere integrations, refactoring SDK usage, or establishing team coding standards for Cohere API v2. Trigger with phrases like "cohere SDK patterns", "cohere best practices", "cohere code patterns", "idiomatic cohere", "cohere wrapper".
Teams using cohere-sdk-patterns should expect a more consistent output, faster repeated execution, less prompt rewriting.
When to use this skill
- You want a reusable workflow that can be run more than once with consistent structure.
When not to use this skill
- You only need a quick one-off answer and do not need a reusable workflow.
- You cannot install or maintain the underlying files, dependencies, or repository context.
Installation
Claude Code / Cursor / Codex
Manual Installation
- Download SKILL.md from GitHub
- Place it in
.claude/skills/cohere-sdk-patterns/SKILL.mdinside your project - Restart your AI agent — it will auto-discover the skill
How cohere-sdk-patterns Compares
| Feature / Agent | cohere-sdk-patterns | Standard Approach |
|---|---|---|
| Platform Support | Not specified | Limited / Varies |
| Context Awareness | High | Baseline |
| Installation Complexity | Unknown | N/A |
Frequently Asked Questions
What does this skill do?
Apply production-ready Cohere SDK patterns for TypeScript and Python. Use when implementing Cohere integrations, refactoring SDK usage, or establishing team coding standards for Cohere API v2. Trigger with phrases like "cohere SDK patterns", "cohere best practices", "cohere code patterns", "idiomatic cohere", "cohere wrapper".
Where can I find the source code?
You can find the source code on GitHub using the link provided at the top of the page.
Related Guides
SKILL.md Source
# Cohere SDK Patterns
## Overview
Production-ready patterns for the `cohere-ai` TypeScript SDK (CohereClientV2) and Python `cohere` package. Real model names, real API shapes, real error types.
## Prerequisites
- `cohere-ai` v7+ installed (TypeScript) or `cohere` v5+ (Python)
- Familiarity with async/await patterns
- Understanding of Cohere API v2 endpoints
## Instructions
### Pattern 1: Singleton Client with Retry
```typescript
// src/cohere/client.ts
import { CohereClientV2, CohereError, CohereTimeoutError } from 'cohere-ai';
let instance: CohereClientV2 | null = null;
export function getCohere(): CohereClientV2 {
if (!instance) {
if (!process.env.CO_API_KEY) {
throw new Error('CO_API_KEY environment variable is required');
}
instance = new CohereClientV2({
token: process.env.CO_API_KEY,
});
}
return instance;
}
export async function withRetry<T>(
operation: () => Promise<T>,
maxRetries = 3,
baseDelayMs = 1000
): Promise<T> {
for (let attempt = 1; attempt <= maxRetries; attempt++) {
try {
return await operation();
} catch (err) {
if (attempt === maxRetries) throw err;
// Only retry on rate limits (429) and server errors (5xx)
if (err instanceof CohereError) {
const status = err.statusCode;
if (status && status !== 429 && status < 500) throw err;
} else if (!(err instanceof CohereTimeoutError)) {
throw err;
}
const delay = baseDelayMs * Math.pow(2, attempt - 1);
const jitter = Math.random() * 500;
await new Promise(r => setTimeout(r, delay + jitter));
}
}
throw new Error('Unreachable');
}
```
### Pattern 2: Type-Safe Chat Wrapper
```typescript
// src/cohere/chat.ts
import { getCohere, withRetry } from './client';
interface ChatOptions {
message: string;
systemPrompt?: string;
model?: string;
maxTokens?: number;
temperature?: number;
documents?: Array<{ id?: string; data: Record<string, string> }>;
}
export async function chat(options: ChatOptions): Promise<string> {
const cohere = getCohere();
const messages: Array<{ role: string; content: string }> = [];
if (options.systemPrompt) {
messages.push({ role: 'system', content: options.systemPrompt });
}
messages.push({ role: 'user', content: options.message });
const response = await withRetry(() =>
cohere.chat({
model: options.model ?? 'command-a-03-2025',
messages,
maxTokens: options.maxTokens,
temperature: options.temperature,
documents: options.documents,
})
);
return response.message?.content?.[0]?.text ?? '';
}
```
### Pattern 3: Streaming Chat
```typescript
// src/cohere/stream.ts
export async function* streamChat(
message: string,
model = 'command-a-03-2025'
): AsyncGenerator<string> {
const cohere = getCohere();
const stream = await cohere.chatStream({
model,
messages: [{ role: 'user', content: message }],
});
for await (const event of stream) {
if (event.type === 'content-delta') {
const text = event.delta?.message?.content?.text;
if (text) yield text;
}
}
}
// Usage
for await (const chunk of streamChat('Explain RAG in 3 sentences')) {
process.stdout.write(chunk);
}
```
### Pattern 4: Batch Embedding
```typescript
// src/cohere/embed.ts
type InputType = 'search_document' | 'search_query' | 'classification' | 'clustering';
export async function embedTexts(
texts: string[],
inputType: InputType = 'search_document',
model = 'embed-v4.0'
): Promise<number[][]> {
const cohere = getCohere();
// Cohere embed accepts up to 96 texts per call
const BATCH_SIZE = 96;
const allEmbeddings: number[][] = [];
for (let i = 0; i < texts.length; i += BATCH_SIZE) {
const batch = texts.slice(i, i + BATCH_SIZE);
const response = await withRetry(() =>
cohere.embed({
model,
texts: batch,
inputType,
embeddingTypes: ['float'],
})
);
allEmbeddings.push(...response.embeddings.float);
}
return allEmbeddings;
}
```
### Pattern 5: Rerank with Type Safety
```typescript
// src/cohere/rerank.ts
interface RerankResult {
text: string;
score: number;
originalIndex: number;
}
export async function rerankDocuments(
query: string,
documents: string[],
topN = 5,
model = 'rerank-v3.5'
): Promise<RerankResult[]> {
const cohere = getCohere();
const response = await withRetry(() =>
cohere.rerank({ model, query, documents, topN })
);
return response.results.map(r => ({
text: documents[r.index],
score: r.relevanceScore,
originalIndex: r.index,
}));
}
```
### Pattern 6: Structured JSON Output
```typescript
export async function chatJSON<T>(
message: string,
schema?: Record<string, unknown>
): Promise<T> {
const cohere = getCohere();
const response = await cohere.chat({
model: 'command-a-03-2025',
messages: [{ role: 'user', content: `${message}\n\nRespond in valid JSON.` }],
responseFormat: schema
? { type: 'json_object', jsonSchema: schema }
: { type: 'json_object' },
});
const text = response.message?.content?.[0]?.text ?? '{}';
return JSON.parse(text) as T;
}
```
## Python Equivalents
```python
import cohere
from cohere import ClientV2
# Singleton
_client: ClientV2 | None = None
def get_cohere() -> ClientV2:
global _client
if _client is None:
_client = ClientV2() # reads CO_API_KEY
return _client
# Chat
def chat(message: str, model: str = "command-a-03-2025") -> str:
co = get_cohere()
response = co.chat(
model=model,
messages=[{"role": "user", "content": message}],
)
return response.message.content[0].text
# Embed
def embed(texts: list[str], input_type: str = "search_document") -> list[list[float]]:
co = get_cohere()
response = co.embed(
model="embed-v4.0",
texts=texts,
input_type=input_type,
embedding_types=["float"],
)
return response.embeddings.float
```
## Error Handling
| Error Type | When | Recovery |
|------------|------|----------|
| `CohereError` (status 400) | Bad request params | Fix request, do not retry |
| `CohereError` (status 401) | Invalid API key | Check CO_API_KEY |
| `CohereError` (status 429) | Rate limited | Retry with backoff |
| `CohereError` (status 5xx) | Server error | Retry with backoff |
| `CohereTimeoutError` | Network timeout | Retry with backoff |
## Resources
- [Cohere TypeScript SDK](https://github.com/cohere-ai/cohere-typescript)
- [Cohere Python SDK](https://github.com/cohere-ai/cohere-python)
- [API v2 Reference](https://docs.cohere.com/reference/about)
## Next Steps
Apply patterns in `cohere-core-workflow-a` for RAG workflows.Related Skills
exa-sdk-patterns
Apply production-ready exa-js SDK patterns with type safety, singletons, and wrappers. Use when implementing Exa integrations, refactoring SDK usage, or establishing team coding standards for Exa. Trigger with phrases like "exa SDK patterns", "exa best practices", "exa code patterns", "idiomatic exa", "exa wrapper".
exa-reliability-patterns
Implement Exa reliability patterns: query fallback chains, circuit breakers, and graceful degradation. Use when building fault-tolerant Exa integrations, implementing fallback strategies, or adding resilience to production search services. Trigger with phrases like "exa reliability", "exa circuit breaker", "exa fallback", "exa resilience", "exa graceful degradation".
evernote-sdk-patterns
Advanced Evernote SDK patterns and best practices. Use when implementing complex note operations, batch processing, search queries, or optimizing SDK usage. Trigger with phrases like "evernote sdk patterns", "evernote best practices", "evernote advanced", "evernote batch operations".
elevenlabs-sdk-patterns
Apply production-ready ElevenLabs SDK patterns for TypeScript and Python. Use when implementing ElevenLabs integrations, refactoring SDK usage, or establishing team coding standards for audio AI applications. Trigger: "elevenlabs SDK patterns", "elevenlabs best practices", "elevenlabs code patterns", "idiomatic elevenlabs", "elevenlabs typescript".
documenso-sdk-patterns
Apply production-ready Documenso SDK patterns for TypeScript and Python. Use when implementing Documenso integrations, refactoring SDK usage, or establishing team coding standards for Documenso. Trigger with phrases like "documenso SDK patterns", "documenso best practices", "documenso code patterns", "idiomatic documenso".
deepgram-sdk-patterns
Apply production-ready Deepgram SDK patterns for TypeScript and Python. Use when implementing Deepgram integrations, refactoring SDK usage, or establishing team coding standards for Deepgram. Trigger: "deepgram SDK patterns", "deepgram best practices", "deepgram code patterns", "idiomatic deepgram", "deepgram typescript".
databricks-sdk-patterns
Apply production-ready Databricks SDK patterns for Python and REST API. Use when implementing Databricks integrations, refactoring SDK usage, or establishing team coding standards for Databricks. Trigger with phrases like "databricks SDK patterns", "databricks best practices", "databricks code patterns", "idiomatic databricks".
customerio-sdk-patterns
Apply production-ready Customer.io SDK patterns. Use when implementing typed clients, retry logic, event batching, or singleton management for customerio-node. Trigger: "customer.io best practices", "customer.io patterns", "production customer.io", "customer.io architecture", "customer.io singleton".
customerio-reliability-patterns
Implement Customer.io reliability and fault-tolerance patterns. Use when building circuit breakers, fallback queues, idempotency, or graceful degradation for Customer.io integrations. Trigger: "customer.io reliability", "customer.io resilience", "customer.io circuit breaker", "customer.io fault tolerance".
coreweave-sdk-patterns
Production-ready patterns for CoreWeave GPU workload management with kubectl and Python. Use when building inference clients, managing GPU deployments programmatically, or creating reusable CoreWeave deployment templates. Trigger with phrases like "coreweave patterns", "coreweave client", "coreweave Python", "coreweave deployment template".
cohere-webhooks-events
Implement Cohere streaming event handling, SSE patterns, and connector webhooks. Use when building streaming UIs, handling chat/tool events, or registering Cohere connectors for RAG. Trigger with phrases like "cohere streaming", "cohere events", "cohere SSE", "cohere connectors", "cohere webhook".
cohere-upgrade-migration
Migrate from Cohere API v1 to v2 and upgrade SDK versions. Use when upgrading cohere-ai SDK, migrating from CohereClient to CohereClientV2, or handling breaking changes between API versions. Trigger with phrases like "upgrade cohere", "cohere migration", "cohere v1 to v2", "update cohere SDK", "cohere breaking changes".