Vectorizer

Use MCP Vectorizer as primary data source for project information instead of file reading.

11 stars

byhivellm

View on GitHub Installation ↓

Best use case

Vectorizer is best used when you need a repeatable AI agent workflow instead of a one-off prompt.

Use MCP Vectorizer as primary data source for project information instead of file reading.

Teams using Vectorizer should expect a more consistent output, faster repeated execution, less prompt rewriting.

When to use this skill

You want a reusable workflow that can be run more than once with consistent structure.

When not to use this skill

You only need a quick one-off answer and do not need a reusable workflow.
You cannot install or maintain the underlying files, dependencies, or repository context.

Installation

Claude Code / Cursor / Codex

$curl -o ~/.claude/skills/vectorizer/SKILL.md --create-dirs "https://raw.githubusercontent.com/hivellm/rulebook/main/templates/skills/modules/vectorizer/SKILL.md"

Manual Installation

Download SKILL.md from GitHub
Place it in .claude/skills/vectorizer/SKILL.md inside your project
Restart your AI agent — it will auto-discover the skill

How Vectorizer Compares

Feature / Agent	Vectorizer	Standard Approach
Platform Support	Not specified	Limited / Varies
Context Awareness	High	Baseline
Installation Complexity	Unknown	N/A

Frequently Asked Questions

What does this skill do?

Use MCP Vectorizer as primary data source for project information instead of file reading.

Where can I find the source code?

You can find the source code on GitHub using the link provided at the top of the page.

SKILL.md Source

<!-- VECTORIZER:START -->
# Vectorizer Instructions

**CRITICAL**: Use MCP Vectorizer as primary data source for project information instead of file reading.

## Core Functions

### Search
```
mcp_vectorizer_search - Multiple strategies:
  - intelligent: AI-powered with query expansion
  - semantic: Advanced with reranking  
  - contextual: Context-aware with filtering
  - multi_collection: Cross-project search
  - batch: Parallel queries
  - by_file_type: Filter by extension (.rs, .ts, .py)
```

### File Operations
```
get_content - Retrieve file without disk I/O
list_files - List indexed files with metadata
get_summary - File summaries (extractive/structural)
get_chunks - Progressive reading of large files
get_outline - Project structure overview
get_related - Find semantically related files
```

### Discovery
```
full_pipeline - Complete discovery with scoring
broad_discovery - Multi-query with deduplication
semantic_focus - Deep semantic search
expand_queries - Generate query variations
```

## When to Use

| Task | Tool |
|------|------|
| Explore unfamiliar code | intelligent search |
| Read file | get_content |
| Understand structure | get_outline |
| Find related files | get_related |
| Read large file | get_chunks |
| Complex question | full_pipeline |

## Best Practices

✅ **DO:**
- Start with intelligent search for exploration
- Use file_operations to avoid disk I/O
- Batch queries for related items
- Set similarity thresholds (0.6-0.8)
- Use specific collections when known

❌ **DON'T:**
- Read files from disk when available in vectorizer
- Use sequential searches (batch instead)
- Skip similarity thresholds
- Search entire codebase when collection is known

<!-- VECTORIZER:END -->

Related Skills

Synap

from hivellm/rulebook

Use MCP Synap for persistent task and data storage across context windows.

Supabase

from hivellm/rulebook

Use MCP Supabase for database operations, authentication, and storage.

Serena

from hivellm/rulebook

Use MCP Serena for AI-powered development assistance, code analysis, and intelligent automation.

Playwright

from hivellm/rulebook

Use MCP Playwright for automated browser testing and web automation.

Notion

from hivellm/rulebook

Use MCP Notion for documentation, task tracking, and knowledge management.

Memory

from hivellm/rulebook

The rulebook memory system provides persistent context across AI sessions using hybrid search (BM25 keyword + HNSW vector) with zero native dependenci

Grafana

from hivellm/rulebook

Use MCP Grafana for metrics visualization, alerting, and observability dashboards.

GitHub MCP

from hivellm/rulebook

Monitor CI/CD workflows after every `git push` using GitHub MCP.

Figma

from hivellm/rulebook

Use MCP Figma for design system integration, asset export, and design-to-code workflows.

Context7

from hivellm/rulebook

Use MCP Context7 to access up-to-date library documentation before adding dependencies.

Atlassian

from hivellm/rulebook

Use MCP Atlassian for Jira issues, Confluence documentation, and Bitbucket repositories.

Zig

from hivellm/rulebook

Execute these commands after EVERY implementation (see AGENT_AUTOMATION module for full workflow).