Gives any MCP client (OpenCode, Claude Code, Claude Desktop, Cursor, etc.) the ability to process images by automatically converting them to text descriptions using a vision model, so that text-only LLMs can handle image-based queries.
Claim it to get a verified publisher badge, a free copy of our full audit findings, and direct contact for any high-priority issues we find.
Install from
M8ven verifies MCPs across every public registry — install directly from whichever one you prefer.
process.env. You'll be asked to provide them before it can run.VISION_BASE_URL— 视觉模型 API 地址,到 /v1 为止(不带 /chat/completions 或 /responses,代码按 style 自动拼)VISION_API_KEY— "": "sk-xxxxx",VISION_MODEL— 模型名(qwen3.7-plus / gpt-4o / llava:13b / gpt-5.4 等)VISION_API_STYLE— API 风格:chat(默认,/chat/completions)或 responses(GPT-5 等新模型,/responses)REQUEST_TIMEOUT[](https://m8ven.ai/mcp/believe3344-multimodal-mcp-1w97s2)