AIndex
Download free
ProductProductLocal index, MCP tools, memoryProofMeasured local indexing and latencyIntegrationsCodex, Claude Code, Cursor, Windsurf, VS Code, ZedSecurityLocal-first boundary and supply chain
PricingPricingFree, Advanced, Pro, Team, Lifetime, EnterpriseEnterpriseOffline license, procurement and rolloutFAQRefunds, billing, install and support answers
UseDocsInstall, index and connect agentsDownloadsArchives, VSIX, Homebrew and verificationAccountPaid license status, seats and billing portalSupportStructured help for install, billing and agents
ReleaseInstall script/install.shRelease manifest/releases/latest/release.jsonSBOM/releases/latest/sbom.cdx.jsonProvenance/releases/latest/provenance.json

Proof

Up to 80% fewer tokens — measured, not promised.

80.6% median token reduction across 7 real repos (5,287 files), reproducible from a single command. Focused context in ~15ms. Re-run the numbers yourself.

Measured proofindex → search → context
−80.6%fewer tokens per task (median)
15,312 → 2,502median tokens per measured task
37 msmedian context query
12central-symbol tasks measured on AiIndexer

Beyond context

AIndex does more than fetch context.

Same graph-context core, plus a full audit & analytics layer vexp doesn't ship. Based on vexp's advertised features (vexp.dev).

CapabilityAIndexvexp
Graph-aware context capsule
Impact / blast-radius analysis
Persistent cross-session memory
Execution-path tracing
Dead-code detection
Duplicate-code detection
Code smells & audits
Finds your most critical code
Token-savings analytics
Offline / air-gapped license

vexp numbers: vexp.dev and the vexp-swe-bench repo (they also report 73% SWE-bench pass@1, and support more languages/agents). Methodologies differ — a same-harness head-to-head is on our roadmap. · vexp.dev · vexp-swe-bench

ProjectFilesTokens savedQuery
AiIndexer33182.5%37 ms
matchr25387%41 ms
crates7083%14 ms
poisson1,38081%73 ms
monsite19278%2 ms
MyRoadTrip1,11065%81 ms
salecast1,95154%10 ms
Total / median5,28780.6%

Token reduction = AIndex's focused context capsule vs reading the files that contain the search hits in full, measured over each repo's most central symbols at a 32k budget. The proof also checks compression quality: top hit inclusion, top-candidate symbol coverage and top-candidate file coverage.

Compare

Pasting code into chat isn't a context strategy.

AIndex isn't another chat box. It's the local context layer that feeds your existing agent exactly the connected code each task needs.

How you give context todayUpsideCatchWith AIndex
Copy-paste by handFast for a single lineMisses everything the code connects toReturns the connected code automatically.
Keyword / text searchFinds stringsIgnores how the code is wiredRanks by the dependency graph, not just text.
Cloud / hosted code indexCentralizedUploads your source to a serverStays 100% local by default.
Dump whole filesEasyBloats the context window (20–50k tokens)Sends a tiered capsule — typically 4–8k tokens.
Embedding RAG (vector search)Semantic-ish recallFuzzy matches; no real call/usage graphRanks on the real dependency graph and shows impact — not just similarity.
IDE go-to-definitionPrecise jumpsOne symbol at a time; no task-level contextAssembles the whole task capsule in a single MCP call.
Other MCP code indexersRanked context, localOften cap daily calls and paywall graph tools (callers, impact); their “semantic” is shallow keyword/TF-IDFGraph navigation — references, callers, impact — included on Free. Unlimited calls. Exact, graph-resolved results.
Manual reference huntingWorks in your IDEYour AI agent can’t do it — it falls back to grep (misses indirect calls, hits comments/strings)Gives the agent Find-All-References, callers/callees and go-to-definition with exact file:line.

Honest by default

Every number ties back to a command.

Each benchmark is generated by the binary on a local run — repo size, symbols, edges and re-index speed, with nothing massaged by hand.

ai-indexer-mcp proof --repo ~/Documents/matchr --json
Open raw JSON

Latest run

The exact JSON, straight from the tool.

Loading benchmark data…

Honest limits

What these numbers do and don't say.

These are local indexing and re-index timings on a single machine, not a hosted benchmark. Your figures depend on repo size, language mix and disk speed — run the command on your own code to see real numbers.