An MCP server that routes LLM requests across multiple providers and orchestrates other MCP servers, with a focus on local privacy for embeddings and memory.
Claim it to get a verified publisher badge, a free copy of our full audit findings, and direct contact for any high-priority issues we find.
Install from
M8ven verifies MCPs across every public registry — install directly from whichever one you prefer.
process.env. You'll be asked to provide them before it can run.OPENAI_BASE_URLOPENROUTER_BASE_URLDEEPINFRA_BASE_URLANTHROPIC_BASE_URLDEEPSEEK_BASE_URLMCP_JUDGE_DATABASE_URL— export ="sqlite:///./.mcp-llm-router/judge_history.db"LLM_API_KEY— Note: the demo skips request_plan_approval because it requires user elicitation. Ensure DEEPSEEK_API_KEY (or ) is set and Ollama is running for embeddings.LLM_MODEL_NAMELLM_BASE_URLDEEPSEEK_API_KEY— "": "your-deepseek-key",MODELMCP_JUDGE_ENABLE_TASKSEMBEDDINGS_BASE_URL— "": "http://localhost:11434",EMBEDDINGS_MODEL— export ="nomic-embed-text"EMBEDDINGS_PROVIDER— "": "ollama",EMBEDDINGS_PATH— export ="/api/embed"RERANK_BASE_URL— export ="https://api.openai.com/v1"RERANK_MODEL— export ="tomaarsen/Qwen3-Reranker-0.6B-seq-cls" # Default modelRERANK_PROVIDER— export ="local"RERANK_PATH— export ="/chat/completions"RERANK_MODE— export ="local"EMBEDDINGS_API_KEY_ENV— No needed for local Ollama!RERANK_API_KEY_ENV— export ="OPENAI_API_KEY"MCP_ROUTER_MEMORY_DB— export ="./.mcp-llm-router/memory.db"ROUTER_BRAIN_MODEL— "": "deepseek-reasoner",ROUTER_BRAIN_BASE_URL— "": "https://openrouter.ai/api/v1",ROUTER_BRAIN_API_KEY_ENV— "": "DEEPSEEK_API_KEY",ROUTER_BRAIN_PROVIDER— "": "deepseek",ROUTER_BRAIN_MAX_TOKENS— export ="4000"ROUTER_BRAIN_TEMPERATURE— export ="0.2"ROUTER_BRAIN_SYSTEM_PROMPTROUTER_BRAIN_REASONING_EFFORT[](https://m8ven.ai/mcp/groxaxo-mcp-llm-router-32qptg)