Enables LLMs to query documents using semantic search, supporting PDFs, Word, Excel, and more. Organizes documents by topics from folder structure and provides advanced search features like phrase matching and date filtering.
Claim it to get a verified publisher badge, a free copy of our full audit findings, and direct contact for any high-priority issues we find.
Install from
M8ven verifies MCPs across every public registry — install directly from whichever one you prefer.
process.env. You'll be asked to provide them before it can run.DOCS_DIR— Documents directory (default: my-docs, configurable via DOCS_DIR env var)CHUNKING_STRATEGY— Chunking strategy: fixed_size, by_paragraph, semantic_heading, or by_token (default: by_paragraph)CHUNK_BY_TOKEN— CHUNK_SIZE - Characters per chunk (or tokens if =true) (default: 1000)TOKENIZER_MODEL— TikToken tokenizer for token chunking (default: cl100k_base)PRESERVE_HEADINGS— Preserve heading structure in chunks (env var, default: True)MAX_HEADING_CHUNK_SIZE— Max chars per heading-based chunk (default: 2000)DEFAULT_SEARCH_RESULTS— Default number of results (default: 10)MAX_SEARCH_RESULTS— Maximum allowed results (default: 20)USE_RERANKER— Enable re-ranker model for improved results (env var, default: True)RERANKER_MODEL— Re-ranker model name (default: cross-encoder/ms-marco-MiniLM-L-6-v2)RERANKER_TOP_N— Number of results to re-rank (default: 50)USE_BM25— Enable hybrid search with BM25 (env var, default: False)BM25_WEIGHTDEFAULT_TOPIC— Default topic for uncategorized docs (default: uncategorized)CHROMA_COLLECTION_NAME— Collection name (default: my-documents)FULL_SCAN_ON_BOOT— Set to True to scan documents on startup (default: False)FOLDER_WATCHER_ACTIVE_ON_BOOT— Set to True to start folder watcher on startup (default: True)DEBOUNCE_SECONDSFULL_SCAN_INTERVAL_DAYSMCP_TRANSPORTMCP_HOST— export =0.0.0.0MCP_PORT— export =38777[](https://m8ven.ai/mcp/dorianmeric-docs-to-ai-0e9bs7)