69
/ 100
4 days ago
glama

Docs-to-AI

Enables LLMs to query documents using semantic search, supporting PDFs, Word, Excel, and more. Organizes documents by topics from folder structure and provides advanced search features like phrase matching and date filtering.

Is this your MCP?

Claim it to get a verified publisher badge, a free copy of our full audit findings, and direct contact for any high-priority issues we find.

Install from

M8ven verifies MCPs across every public registry — install directly from whichever one you prefer.

// key findings
No credential exfiltration, no sensitive file access, no obfuscation
Static analysis found nothing flowing your secrets to unexpected places.
Open source with a license and README
Anyone can audit the code, the license is declared, and the publisher documents what it does.
🔐
You'll be asked for 1 credential: CHUNK_BY_TOKEN
These are read from process.env at runtime. Make sure you trust where they’ll be sent.
// required environment variables
This server reads these from process.env. You'll be asked to provide them before it can run.
configDOCS_DIRDocuments directory (default: my-docs, configurable via DOCS_DIR env var)
configCHUNKING_STRATEGYChunking strategy: fixed_size, by_paragraph, semantic_heading, or by_token (default: by_paragraph)
🔐 secretCHUNK_BY_TOKENCHUNK_SIZE - Characters per chunk (or tokens if =true) (default: 1000)
configTOKENIZER_MODELTikToken tokenizer for token chunking (default: cl100k_base)
configPRESERVE_HEADINGSPreserve heading structure in chunks (env var, default: True)
configMAX_HEADING_CHUNK_SIZEMax chars per heading-based chunk (default: 2000)
configDEFAULT_SEARCH_RESULTSDefault number of results (default: 10)
configMAX_SEARCH_RESULTSMaximum allowed results (default: 20)
configUSE_RERANKEREnable re-ranker model for improved results (env var, default: True)
configRERANKER_MODELRe-ranker model name (default: cross-encoder/ms-marco-MiniLM-L-6-v2)
configRERANKER_TOP_NNumber of results to re-rank (default: 50)
configUSE_BM25Enable hybrid search with BM25 (env var, default: False)
configBM25_WEIGHT
configDEFAULT_TOPICDefault topic for uncategorized docs (default: uncategorized)
configCHROMA_COLLECTION_NAMECollection name (default: my-documents)
configFULL_SCAN_ON_BOOTSet to True to scan documents on startup (default: False)
configFOLDER_WATCHER_ACTIVE_ON_BOOTSet to True to start folder watcher on startup (default: True)
configDEBOUNCE_SECONDS
configFULL_SCAN_INTERVAL_DAYS
configMCP_TRANSPORT
configMCP_HOSTexport =0.0.0.0
configMCP_PORTexport =38777
// full audit trail
The full breakdown of what we checked, the deductions that landed, the network hosts, the dependency advisories, and concrete fix guidance is available to verified publishers.
// improvement guidance — verified publishers only
We have 5 concrete improvements we can share with the publisher of this MCP. Each comes with specific guidance to raise the trust score.
// embed badge in your README
[![M8ven Score](https://m8ven.ai/badge/mcp/dorianmeric-docs-to-ai-0e9bs7)](https://m8ven.ai/mcp/dorianmeric-docs-to-ai-0e9bs7)
commit: 153873183b2d9bda16998a1f54bdb2adc06da4c0
code hash: 469d4872dc8c99d8f09255a764496047b1478798cc547c7505f2e2c39959501f
verified: 6/22/2026, 12:26:40 PM
view raw JSON →