About Claude Context

Claude Context (zilliztech/claude-context) is a Model Context Protocol server that solves the most common Claude Code cost problem: loading entire file trees into context on every request. Instead, Claude Context indexes your codebase into Zilliz Cloud's vector database and uses hybrid search (BM25 + dense vector embeddings) to retrieve only the code relevant to each specific request. The result: ~40% token reduction with equivalent or better retrieval quality. Three packages: claude-context-core (indexing engine), a VSCode extension (semantic code search), and claude-context-mcp (MCP server for Claude Code integration). Works with any codebase size — from small projects to multi-million-line enterprise repos.

Key Features

MCP server for Claude Code — native protocol integration

~40% token reduction — same retrieval quality, lower cost

Hybrid search: BM25 + dense vector embeddings

Indexes entire codebase into Zilliz Cloud vector database

Works at any codebase scale — from small projects to enterprise repos

VSCode extension for semantic code search

Monorepo: core, MCP server, and VSCode extension as separate packages

Natural language queries against your indexed codebase

Overview

Claude Context is Zilliz’s solution to the “full codebase in context” problem that drives up Claude Code costs at scale. The pattern is familiar: every Claude Code request loads a broad set of files as context, even when the actual task touches a small subset of code. At $15/M output tokens for Opus, those wasted tokens add up.

Claude Context turns your codebase into a vector database and serves as a Model Context Protocol server. Each request queries the index with hybrid search and loads only the code actually relevant to the task. Zilliz, the company behind Milvus (the most widely-deployed open-source vector database), brings production-grade vector infrastructure to the problem.

Technical Architecture

Indexing engine (claude-context-core): Parses your codebase, generates embeddings for code chunks, and stores them in Zilliz Cloud with BM25 keyword indices alongside dense vector indices. Indexing happens once; updates are incremental.

MCP server (claude-context-mcp): Exposes a search_code tool that Claude Code calls automatically when it needs codebase context. Claude asks “find the authentication middleware” and the MCP server returns the three most relevant files instead of the entire auth directory.

VSCode extension: Provides semantic code search in the editor — useful for navigation and review independently of Claude Code integration.

Token Reduction Mechanics

The ~40% reduction comes from replacing broad directory loads with targeted retrieval. A request that previously loaded 50 files to find the relevant 5 now loads 5 files directly. The retrieval quality is maintained or improved because hybrid search (keyword + semantic) is more precise than file-tree heuristics.

Setup

# Install the MCP server
npm install -g @zilliz/claude-context-mcp

# Index your codebase (requires Zilliz Cloud account)
npx claude-context-core index --project-path ./

# Add to Claude Code's MCP config
# claude_mcp_config.json:
# { "mcpServers": { "claude-context": { "command": "claude-context-mcp" } } }

Who Should Use It

Claude Context is most valuable for teams with large codebases (100K+ lines) where Claude Code is already in the daily workflow and token costs are measurable. Smaller projects may not see significant savings relative to the setup overhead. Requires a Zilliz Cloud account for the vector database backend.

Claude Context

About Claude Context

Key Features

Overview

Technical Architecture

Token Reduction Mechanics

Setup

Who Should Use It

Similar Agents

agent-native

Agent-Reach

agentmemory