MCP Marketplace
BrowseHow It WorksFor CreatorsDocs
Sign inSign up
MCP Marketplace

The curated, security-first marketplace for AI tools.

Product

Browse ToolsSubmit a ToolDocumentationHow It WorksBlogFAQ

Legal

Terms of ServicePrivacy PolicyCommunity Guidelines

Connect

support@mcp-marketplace.ioTwitter / XDiscord

MCP Marketplace © 2026. All rights reserved.

Back to Browse

Deeprepo MCP Server

by Abhishek2432001
Developer ToolsLow Risk10.0MCP RegistryLocal
Free

Server data from the Official MCP Registry

Productivity-boosting RAG engine for codebases with multi-provider AI support and semantic search.

About

Productivity-boosting RAG engine for codebases with multi-provider AI support and semantic search.

Security Report

10.0
Low Risk10.0Low Risk

Valid MCP server (1 strong, 4 medium validity signals). No known CVEs in dependencies. Imported from the Official MCP Registry.

6 files analyzed · No issues found

Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.

Permissions Required

This plugin requests these system permissions. Most are normal for its category.

file_system

Check that this permission is expected for this type of plugin.

HTTP Network Access

Connects to external APIs or services over the internet.

env_vars

Check that this permission is expected for this type of plugin.

Documentation

View on GitHub

From the project's GitHub README.

DeepRepo — Local RAG Engine for Codebases

A production-grade Python library for performing RAG (Retrieval Augmented Generation) on local codebases. No heavy frameworks, no external vector DBs, no cloud required.

What It Does

DeepRepo ingests a codebase and builds three things simultaneously:

LayerWhat it storesUsed for
Code Knowledge GraphClasses, functions, imports, call edges (SQLite)Symbol lookup, blast-radius analysis
Embeddings + FTS indexSemantic vectors + full-text searchRelevant code retrieval
Hierarchical WikiPlain-English .md files per moduleAI explanations, chat context

A smart query router classifies every question and picks the cheapest context strategy, reducing LLM token usage by 5–50x compared to naive RAG.


Features

  • Zero dependencies on heavy frameworks — pure Python, SQLite-backed
  • Multiple AI providers — Ollama (free/local), OpenAI, Anthropic, Gemini, HuggingFace
  • CLI-first — deeprepo ingest . / deeprepo serve / deeprepo query "…"
  • Wiki viewer — browsable, searchable HTML wiki with in-page chat (deeprepo serve)
  • 7 focused MCP tools — drop DeepRepo into Cursor / Claude Desktop as an MCP server
  • Branch isolation — per-branch SQLite databases with copy-on-write from base branches
  • 3-tier retrieval — Embeddings → FTS → Graph fallback for resilient search
  • Incremental ingestion — unchanged files are skipped; only deltas re-processed

Quick Start

1. Install

cd deeprepo_core
pip install -e .

For MCP server support:

pip install -e ".[mcp]"

2. Install Ollama (free, local — recommended)

# macOS
brew install ollama
ollama serve                          # keep this running

ollama pull nomic-embed-text          # embedding model
ollama pull llama3.1:8b               # LLM

3. Ingest your codebase

cd /path/to/your/project
deeprepo ingest .

4. Browse the wiki

deeprepo serve                        # opens http://localhost:8080

5. Ask questions

deeprepo query "how does authentication work?"
deeprepo query "what breaks if I change auth.py?"

CLI Reference

deeprepo <command> [options]
CommandWhat it does
deeprepo initDetect provider setup, print the ingest command
deeprepo ingest [PATH]Scan repo → build graph + wiki + embeddings
deeprepo wiki [PATH]Regenerate wiki pages only (skip re-indexing)
deeprepo serveLaunch wiki viewer + in-page chat at port 8080
deeprepo query "QUESTION"Ask a question, get an AI answer
deeprepo statusShow branch isolation & cache freshness

Common flags (all commands)

--llm ollama|openai|anthropic|gemini|huggingface   # LLM provider
--embed ollama|openai|huggingface                  # embedding provider (default: same as --llm)
--branch-isolation                                 # enable per-branch databases
--base-branch main                                 # seed feature-branch cache from main
--wiki-dir .deeprepo/wiki                          # override wiki output directory

ingest flags

--chunk-size N      # chars per text chunk (default: 1000)
--overlap N         # overlap between chunks (default: 100)
--workers N         # wiki parallel workers (default: 3)
--no-wiki           # skip wiki generation

serve flags

--port N            # HTTP port (default: 8080)

Examples

# Ollama (free, fully local)
deeprepo ingest .

# OpenAI embeddings + Anthropic LLM
deeprepo ingest . --embed openai --llm anthropic

# Branch isolation for a feature branch
deeprepo ingest . --branch-isolation --base-branch main

# Serve wiki with chat on a custom port
deeprepo serve --llm openai --port 9000

# Query with specific top-k results
deeprepo query "where is AuthService defined?" --top-k 3

Python API

from deeprepo import DeepRepoClient

# Single provider (backward-compatible shorthand)
client = DeepRepoClient(provider_name="ollama")

# Split providers — Anthropic LLM + OpenAI embeddings
client = DeepRepoClient(
    embedding_provider_name="openai",
    llm_provider_name="anthropic",
)

# Branch isolation (team workflow)
client = DeepRepoClient(
    provider_name="ollama",
    branch_isolation=True,
    base_branches=["main"],
)

# Ingest (incremental — unchanged files are skipped)
result = client.ingest("/path/to/your/code")
print(f"Files: {result['files_scanned']}, Wiki pages: {result['wiki_generated']}")

# Query — smart routing selects the cheapest context strategy
response = client.query("How does authentication work?")
print(response['answer'])
print(f"Intent: {response['intent']}, Strategy: {response['strategy']}")
print(f"Sources: {response['sources']}")        # list of file paths

# Browse the generated wiki
print(f"Wiki at: {client.get_wiki_dir()}")

query() return shape

{
    "answer":         str,           # LLM-generated answer
    "sources":        list[str],     # file paths used as context
    "intent":         str,           # navigate | impact | explain | debug | review | general
    "strategy":       str,           # e.g. symbol_lookup, blast_radius, wiki_plus_skeleton, …
    "retrieval":      str,           # embeddings | fts | graph
    "token_estimate": int,           # estimated tokens consumed
    "history":        list[dict],    # conversation history (last N exchanges)
}

Supported AI Providers

ProviderCostSetupBest For
OllamaFREE, unlimitedInstall app + ollama pullLocal dev, privacy, offline
OpenAIPaidOPENAI_API_KEYProduction, best quality
AnthropicPaidANTHROPIC_API_KEYProduction, excellent reasoning
GeminiFree tierGEMINI_API_KEYExperimentation
HuggingFaceFree tierHUGGINGFACE_API_KEYCloud embeddings, no GPU needed

Note: Anthropic has no embeddings API. Pair it with another provider:

client = DeepRepoClient(embedding_provider_name="openai", llm_provider_name="anthropic")

Architecture

deeprepo_core/src/deeprepo/
├── client.py         # Main facade — branch isolation, freshness, provider wiring
├── graph.py          # SQLite store: graph nodes/edges, embeddings, wiki index, state
├── graph_builder.py  # Tree-sitter AST parser → code knowledge graph
├── wiki.py           # Hierarchical wiki engine — bottom-up LLM synthesis
├── router.py         # Intent classifier + 6 context strategy selectors
├── ingestion.py      # File scanner, chunker, language detection
├── interfaces.py     # Abstract base classes (EmbeddingProvider, LLMProvider)
├── registry.py       # @register_embedding / @register_llm decorator system
├── ui.py             # Wiki viewer (HTTP server + mermaid renderer + chat)
├── mcp/
│   └── server.py     # 7 MCP tools for AI assistants (Cursor, Claude Desktop)
└── providers/
    ├── ollama_v.py
    ├── openai_v.py
    ├── anthropic_v.py
    ├── gemini_v.py
    └── huggingface_v.py

.deeprepo/            # Generated (gitignore this)
├── default.db        # SQLite: graph + embeddings + wiki index + state
├── <branch>.db       # Per-branch database when branch_isolation=True
└── wiki/             # Browsable .md wiki files
    ├── overview.md   # Whole-repo narrative overview
    └── *.md          # One page per module

Storage

Everything lives in a single SQLite file per branch — no Redis, no Postgres, no Chroma.

TableContents
nodesFiles, classes, functions with metadata
edgesImport / call relationships between nodes
embeddingsFloat vectors for semantic search
wiki_pagesGenerated wiki markdown (key → content)
wiki_ftsFull-text search index over wiki
statePer-file SHA-256 hashes for incremental updates

Design Patterns

  • Facade — DeepRepoClient is the single entry point; internals are hidden
  • Strategy — LLMProvider / EmbeddingProvider abstract interfaces; providers are swappable
  • Registry — @register_llm("ollama") decorator auto-registers providers at import time
  • Bottom-up synthesis — wiki pages generated leaves-first; parent pages consume child summaries
  • 3-tier fallback — Embeddings → FTS → Graph; queries work even when embeddings are cold
  • Copy-on-write branching — feature branches start from base-branch cache, then delta-update

MCP Server (AI Assistant Integration)

Connect DeepRepo as an MCP server so Cursor, Claude Desktop, or any MCP-compatible AI assistant can call it directly — without ever reading raw files.

Setup

pip install deeprepo[mcp]

Cursor — create ~/.cursor/mcp.json:

{
  "mcpServers": {
    "deeprepo": {
      "command": "python",
      "args": ["-m", "deeprepo.mcp.server"],
      "env": {
        "LLM_PROVIDER": "ollama"
      }
    }
  }
}

Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "deeprepo": {
      "command": "deeprepo-mcp",
      "env": {
        "EMBEDDING_PROVIDER": "openai",
        "LLM_PROVIDER": "anthropic",
        "OPENAI_API_KEY": "sk-...",
        "ANTHROPIC_API_KEY": "sk-ant-..."
      }
    }
  }
}

Available MCP Tools (7 tools)

ToolWhen to useToken cost
ingest_codebaseOne-time setup — index a repo directory—
find_symbol"Where is X defined / what line is X on"~50 tokens
get_file_structure"Show me the API / functions in X"~150 tokens
explain_file"How does X work / explain X / what does X do"~300 tokens
find_change_impact"What breaks if I change X"~300 tokens
ask_codebaseAny open-ended question about the code~600–2000 tokens
get_project_overview"Give me an overview / what does this project do"~600 tokens

Token Reduction vs Naive RAG

Query typeNaive RAGDeepRepoReduction
"where is X defined"~4 000 tokens~80 tokens50x
"what breaks if I change X"~4 000 tokens~300 tokens13x
"how does X work"~4 000 tokens~600 tokens7x
"fix the bug in X"~4 000 tokens~900 tokens4x

CLAUDE.md tip

Add this to your project's CLAUDE.md so Claude automatically uses DeepRepo:

## Code navigation
Before reading any source file directly, use these MCP tools:
- `find_symbol(name)` to locate a class or function
- `get_file_structure(filepath)` to see a file's API without reading it
- `explain_file(filepath)` to understand what a file does
- `find_change_impact(filepath)` before editing any file
- `ask_codebase(question)` for open-ended questions
- `get_project_overview()` at the start of a new session

Only call Read/Grep on a file after the above tools have been tried.

Configuration

Environment Variables

VariableDefaultDescription
LLM_PROVIDERopenaiLLM provider name
EMBEDDING_PROVIDERsame as LLM_PROVIDEREmbedding provider name
OPENAI_API_KEY—Required for OpenAI
ANTHROPIC_API_KEY—Required for Anthropic
GEMINI_API_KEY—Required for Gemini
HUGGINGFACE_API_KEY / HF_TOKEN—Required for HuggingFace
OLLAMA_MODELllama3.1:8bOllama LLM model name
OLLAMA_EMBED_MODELnomic-embed-textOllama embedding model
OLLAMA_BASE_URLhttp://localhost:11434Ollama server URL
OLLAMA_TIMEOUT300LLM response timeout (seconds)

Testing

# Full end-to-end test suite (runs ingest + all checks)
python3 test_deeprepo_flow.py

# Skip ingest, use cached index (faster iteration)
python3 test_deeprepo_flow.py --skip-ingest

# pytest unit tests
pytest tests/unit/ -v

# pytest with coverage
pytest tests/unit/ --cov=deeprepo --cov-report=html

The test_deeprepo_flow.py script tests all 7 sections end-to-end:

  1. Client initialisation & branch flags
  2. Ingest (graph + embeddings + wiki)
  3. WikiEngine — page generation, caching, repo overview
  4. Graph API — skeleton, blast-radius, symbol lookup
  5. RAG / Router — intent classification, query execution
  6. CLI commands — all subcommands + help
  7. Branch isolation flag combinations

Adding a New Provider

  1. Create src/deeprepo/providers/myprovider.py
  2. Implement EmbeddingProvider and/or LLMProvider interfaces
  3. Decorate with @register_embedding("myprovider") / @register_llm("myprovider")
  4. Auto-discovered at import time — no other changes needed
from deeprepo.interfaces import EmbeddingProvider, LLMProvider
from deeprepo.registry import register_embedding, register_llm

@register_embedding("myprovider")
class MyEmbeddingProvider(EmbeddingProvider):
    def embed(self, text: str) -> list[float]:
        ...  # return a list of floats

@register_llm("myprovider")
class MyLLMProvider(LLMProvider):
    def generate(self, prompt: str, context: str | None = None) -> str:
        ...  # return generated text

Documentation

  • DEVELOPER_WORKFLOW_GUIDE.md — daily dev workflows and automation recipes
  • deeprepo_core/README.md — package README (PyPI)
  • docs/high-level-design.excalidraw — process flow diagram
  • docs/class-interaction-design.excalidraw — class diagram

License

MIT License — see LICENSE file for details.


Built for developers who want full control over their RAG pipelines.

Reviews

No reviews yet

Be the first to review this server!

0

installs

New

no ratings yet

Is this your server?

Claim ownership to manage your listing, respond to reviews, and track installs from your dashboard.

Claim with GitHub

Sign up with the GitHub account that owns this repo

Links

Source Code

Details

Published February 24, 2026
Version 1.0.0
0 installs
Local Plugin

More Developer Tools MCP Servers

Git

Free

by Modelcontextprotocol · Developer Tools

Read, search, and manipulate Git repositories programmatically

80.0K
Stars
4
Installs
6.5
Security
No ratings yet
Local

Toleno

Free

by Toleno · Developer Tools

Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.

137
Stars
483
Installs
8.0
Security
4.8
Local

mcp-creator-python

Free

by mcp-marketplace · Developer Tools

Create, build, and publish Python MCP servers to PyPI — conversationally.

-
Stars
65
Installs
10.0
Security
4.6
Local

MarkItDown

Free

by Microsoft · Content & Media

Convert files (PDF, Word, Excel, images, audio) to Markdown for LLM consumption

120.0K
Stars
22
Installs
6.0
Security
5.0
Local

mcp-creator-typescript

Free

by mcp-marketplace · Developer Tools

Scaffold, build, and publish TypeScript MCP servers to npm — conversationally

-
Stars
16
Installs
10.0
Security
5.0
Local

FinAgent

Free

by mcp-marketplace · Finance

Free stock data and market news for any MCP-compatible AI assistant.

-
Stars
16
Installs
10.0
Security
No ratings yet
Local