vector-index-tuning

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

16 stars

bydiegosouzapw

View on GitHub Installation ↓

Best use case

vector-index-tuning is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

Teams using vector-index-tuning should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/vector-index-tuning/SKILL.md --create-dirs "https://raw.githubusercontent.com/diegosouzapw/awesome-omni-skill/main/skills/ai-agents/vector-index-tuning/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/vector-index-tuning/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How vector-index-tuning Compares

Feature / Agent	vector-index-tuning	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Optimize vector index performance for latency, recall, and memory. Use when tuning HNSW parameters, selecting quantization strategies, or scaling vector search infrastructure.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

# Vector Index Tuning

Guide to optimizing vector indexes for production performance.

## Use this skill when

- Tuning HNSW parameters
- Implementing quantization
- Optimizing memory usage
- Reducing search latency
- Balancing recall vs speed
- Scaling to billions of vectors

## Do not use this skill when

- You only need exact search on small datasets (use a flat index)
- You lack workload metrics or ground truth to validate recall
- You need end-to-end retrieval system design beyond index tuning

## Instructions

1. Gather workload targets (latency, recall, QPS), data size, and memory budget.
2. Choose an index type and establish a baseline with default parameters.
3. Benchmark parameter sweeps using real queries and track recall, latency, and memory.
4. Validate changes on a staging dataset before rolling out to production.

Refer to `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.

## Safety

- Avoid reindexing in production without a rollback plan.
- Validate changes under realistic load before applying globally.
- Track recall regressions and revert if quality drops.

## Resources

- `resources/implementation-playbook.md` for detailed patterns, checklists, and templates.

Related Skills

---name: aav-vector-design-agent

from diegosouzapw/awesome-omni-skill

description: AI-powered adeno-associated virus (AAV) vector design for gene therapy including capsid engineering, promoter selection, and tropism optimization.

assembly-index

from diegosouzapw/awesome-omni-skill

Lee Cronin's Assembly Theory for molecular complexity measurement and

langchain4j-vector-stores-configuration

from diegosouzapw/awesome-omni-skill

Configure LangChain4J vector stores for RAG applications. Use when building semantic search, integrating vector databases (PostgreSQL/pgvector, Pinecone, MongoDB, Milvus, Neo4j), implementing embedding storage/retrieval, setting up hybrid search, or optimizing vector database performance for production AI applications.

vector-database-engineer

from diegosouzapw/awesome-omni-skill

Expert in vector databases, embedding strategies, and semantic search implementation. Masters Pinecone, Weaviate, Qdrant, Milvus, and pgvector for RAG applications, recommendation systems, and similar

vector-art

from diegosouzapw/awesome-omni-skill

Vector art assets (characters, objects, scenes) sources for SVG/Canvas and how to animate them

llamaindex

from diegosouzapw/awesome-omni-skill

Data framework for building LLM applications with RAG. Specializes in document ingestion (300+ connectors), indexing, and querying. Features vector indices, query engines, agents, and multi-modal support. Use for document Q&A, chatbots, knowledge retrieval, or building RAG pipelines. Best for data-centric LLM applications.

ai-tuning

from diegosouzapw/awesome-omni-skill

Optimize AI assistant configurations for maximum effectiveness. USE THIS SKILL when user says "improve CLAUDE.md", "better copilot instructions", "tune AI", "optimize prompts", "MCP configuration", or wants to enhance AI assistant behavior.

agentuity-cli-cloud-vector-upsert

from diegosouzapw/awesome-omni-skill

Add or update vectors in the vector storage. Requires authentication. Use for Agentuity cloud platform operations

agentuity-cli-cloud-vector-stats

from diegosouzapw/awesome-omni-skill

Get statistics for vector storage. Requires authentication. Use for Agentuity cloud platform operations

agentuity-cli-cloud-vector-search

from diegosouzapw/awesome-omni-skill

Search for vectors using semantic similarity. Requires authentication. Use for Agentuity cloud platform operations

agentuity-cli-cloud-vector-list-namespaces

from diegosouzapw/awesome-omni-skill

List all vector namespaces. Requires authentication. Use for Agentuity cloud platform operations

agentuity-cli-cloud-vector-get

from diegosouzapw/awesome-omni-skill

Get a specific vector entry by key. Requires authentication. Use for Agentuity cloud platform operations