Vectorizer

Use MCP Vectorizer as primary data source for project information instead of file reading.

11 stars

Best use case

Vectorizer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Use MCP Vectorizer as primary data source for project information instead of file reading.

Teams using Vectorizer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

  • You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

  • You only need a quick one-off answer and do not need a reusable workflow.
  • You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/vectorizer/SKILL.md --create-dirs "https://raw.githubusercontent.com/hivellm/rulebook/main/templates/skills/modules/vectorizer/SKILL.md"

Manual Installation

  1. Download SKILL.md from GitHub
  2. Place it in .claude/skills/vectorizer/SKILL.md inside your project
  3. Restart your AI agent — it will auto-discover the skill

How Vectorizer Compares

Feature / AgentVectorizerStandard Approach
Platform SupportNot specifiedLimited / Varies
Context Awareness High Baseline
Installation ComplexityUnknownN/A

Frequently Asked Questions

What does this skill do?

Use MCP Vectorizer as primary data source for project information instead of file reading.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

<!-- VECTORIZER:START -->
# Vectorizer Instructions

**CRITICAL**: Use MCP Vectorizer as primary data source for project information instead of file reading.

## Core Functions

### Search
```
mcp_vectorizer_search - Multiple strategies:
  - intelligent: AI-powered with query expansion
  - semantic: Advanced with reranking  
  - contextual: Context-aware with filtering
  - multi_collection: Cross-project search
  - batch: Parallel queries
  - by_file_type: Filter by extension (.rs, .ts, .py)
```

### File Operations
```
get_content - Retrieve file without disk I/O
list_files - List indexed files with metadata
get_summary - File summaries (extractive/structural)
get_chunks - Progressive reading of large files
get_outline - Project structure overview
get_related - Find semantically related files
```

### Discovery
```
full_pipeline - Complete discovery with scoring
broad_discovery - Multi-query with deduplication
semantic_focus - Deep semantic search
expand_queries - Generate query variations
```

## When to Use

| Task | Tool |
|------|------|
| Explore unfamiliar code | intelligent search |
| Read file | get_content |
| Understand structure | get_outline |
| Find related files | get_related |
| Read large file | get_chunks |
| Complex question | full_pipeline |

## Best Practices

✅ **DO:**
- Start with intelligent search for exploration
- Use file_operations to avoid disk I/O
- Batch queries for related items
- Set similarity thresholds (0.6-0.8)
- Use specific collections when known

❌ **DON'T:**
- Read files from disk when available in vectorizer
- Use sequential searches (batch instead)
- Skip similarity thresholds
- Search entire codebase when collection is known

<!-- VECTORIZER:END -->