home / mcp / qdrant rag mcp server
Provides semantic search across codebases and intelligent GitHub issue/project workflows via an MCP server.
Configuration
View docs{
"mcpServers": {
"ancoleman-qdrant-rag-mcp": {
"command": "python",
"args": [
"/path/to/qdrant-rag/src/qdrant_mcp_context_aware.py"
],
"env": {
"MCP_CLIENT_CWD": "${workspaceFolder}"
}
}
}
}You run a context-aware MCP server that enables Claude Code to perform fast, semantic searches across your codebase and manage GitHub workflows, projects, and issue resolution with intelligent context handling. This server uses a Qdrant vector store to deliver precise results while tracking context, reducing token usage, and supporting project-specific scopes for safer, scalable AI-assisted development.
You interact with the MCP server through an MCP client integrated in Claude Code. Start in a project, let the system detect the active project, and then perform semantic search, code understanding, or GitHub-driven workflows. You can switch projects, index code on demand, and run specialized operations such as issue analysis, fix generation, and PR creation while benefiting from progressive context and sub-issue management.
Practical usage patterns you can perform include: deep semantic search across code and configuration files, dependency-aware code exploration, and multi-project knowledge separation. You can enable context tracking to see how Claudeβs knowledge grows during a session, monitor token usage, and receive warnings as your context window fills. You can also leverage GitHub integration to analyze issues, generate fixes, create PRs in a safe dry-run mode, and manage projects with V2 tooling.
Follow these concrete steps to install and run the MCP server for quick, reliable access from Claude Code.
# 1. Clone the MCP server repository
git clone <repository-url> ~/mcp-servers/qdrant-rag
cd ~/mcp-servers/qdrant-rag
# 2. Run the setup script to start Qdrant and install dependencies
./scripts/setup.sh
# 3. Install the MCP server globally for context awareness (one-time)
./install_global.sh
# 4. Test in any project by launching Claude Code and checking the current project context
cd ~/any-project
claude
# In Claude, ask: What's my current project context?To configure the MCP server for local, stdio-based operation, point Claude Code to the Python context- aware entry point with an appropriate working directory. The following configuration shows how to run the MCP server as a local process using Python and the provided script path.
{
"mcpServers": {
"qdrant_rag": {
"type": "stdio",
"command": "python",
"args": ["/path/to/qdrant-rag/src/qdrant_mcp_context_aware.py"],
"env": {
"MCP_CLIENT_CWD": "${workspaceFolder}"
}
}
}
}Auto-indexing keeps the knowledge index up to date as you modify files. Enable it for the current session or permanently in your shell profile.
# For the current session only
export QDRANT_RAG_AUTO_INDEX=true
claude
# For permanent auto-indexing
echo 'export QDRANT_RAG_AUTO_INDEX=true' >> ~/.bashrc
# or for zsh users:
echo 'export QDRANT_RAG_AUTO_INDEX=true' >> ~/.zshrcYou can run a health check to verify that Qdrant, embeddings, and project context are operating correctly. The health check will report the status of the Qdrant connection, embedding model, disk space, memory usage, and current project context.
If you encounter issues, use the built-in health status and the project-aware logs to isolate problems. Common issues include misdetected projects, MPS-related performance concerns on Apple Silicon, or indexing delays. The system includes retry logic, health monitoring, and clear error messages to guide you.
When using the GitHub integration, you can perform intelligent issue analysis, generate fixes, and create PRs with dry-run safety. You can also manage GitHub Projects V2 with smart field assignment and create roadmaps or templates. Sub-issues and hierarchical relationships help you break down complex work into parent/child structures.
The server uses progressive context management to pull optimal context levels (file β class β method) for token efficiency. It applies specialized, language-aware embeddings for code, config, and documentation, reducing cross-type noise and improving search relevance.
The MCP server includes safety features such as dry-run mode for GitHub actions, rate limiting, audit logging, and file protection to prevent unsafe operations. It also provides health checks and input validation to prevent path traversal and other common issues.
You can perform a range of actions via Claude Code, including switching repositories, analyzing issues, generating fixes, and updating GitHub project items. The system supports templates for roadmaps, bug tracking, and feature planning, with project-specific logging and dashboards.
RAG-powered issue analysis, automatic fix generation, and dry-run PR creation with smart safety checks.
Create and manage GitHub Projects V2, add items, update fields, and track progress with templates.
Monitor context usage, token consumption, and session history for transparency.
Code, Config, and Documentation embeddings to improve search relevance.
Multi-level context retrieval with semantic caching and intent detection.
Structure-aware parsing for code and configuration files to reduce tokens.
Include dependent files based on import/export relationships.
Combine semantic and keyword search for better precision.
Isolated, rich logs per project for debugging and analysis.
Built-in health checks across all services.