ImageSorcery MCP

The ImageSorcery MCP Server allows your local AI assistant like Claude Code and Cursor to edit and analyze images directly on your local machine.

This means you can tell your AI assistant, in plain language, to perform tasks like cropping photos, removing backgrounds, or detecting objects, and it all happens privately on your computer. You don’t have to upload your images to any third-party service.

Official Website

Key Features

🎯 Object detection with segmentation masks and polygon outputs
📝 OCR text extraction from images using EasyOCR
✂️ Basic image operations: crop, resize, rotate
🎨 Advanced editing: background removal, color changes, blur effects
📐 Drawing tools: arrows, circles, rectangles, text overlays
🔍 Text-based object finding using CLIP models
📊 Image metadata extraction
🖼️ Image overlaying and watermarking
⚙️ Configurable confidence thresholds and model settings

How To Use It

1. Install the package using pipx.

pipx install imagesorcery-mcp

2. Run the post-installation script.

imagesorcery-mcp --post-install

3. Add the ImageSorcery server to your MCP client’s configuration file. For an installation done with pipx, the configuration looks like this:

"mcpServers": {
    "imagesorcery-mcp": {
      "command": "imagesorcery-mcp",
      "transportType": "stdio",
      "autoApprove": ["blur", "change_color", "config", "crop", "detect", "draw_arrows", "draw_circles", "draw_lines", "draw_rectangles", "draw_texts", "fill", "find", "get_metainfo", "ocr", "overlay", "resize", "rotate"],
      "timeout": 100
    }
}

If you installed it manually into a virtual environment, you’ll need to provide the full path to the imagesorcery-mcp executable in the "command" field.

4. Once configured, you can start making requests to your AI assistant. For best results, be specific. For example:

"use imagesorcery to find a cat in photo.jpg and crop the image to center the cat"
"use imagesorcery to extract all text from document.png using OCR"

5. All available tools

Tool	Description
`blur`	Blurs specified rectangular or polygonal areas of an image.
`change_color`	Changes the color palette of an image (e.g., to sepia).
`config`	View and update ImageSorcery MCP configuration settings.
`crop`	Crops an image using NumPy slicing.
`detect`	Detects objects using models from Ultralytics.
`draw_arrows`	Draws arrows on an image.
`draw_circles`	Draws circles on an image.
`draw_lines`	Draws lines on an image.
`draw_rectangles`	Draws rectangles on an image.
`draw_texts`	Draws text on an image.
`fill`	Fills specified areas with a color and opacity.
`find`	Finds objects in an image based on a text description.
`get_metainfo`	Gets metadata information about an image file.
`ocr`	Performs Optical Character Recognition (OCR) on an image.
`overlay`	Overlays one image on top of another.
`resize`	Resizes an image.
`rotate`	Rotates an image.

FAQs

Q: The find tool isn’t working with text prompts. What’s wrong?
A: This is almost always because the clip Python package failed to install. The find tool relies on it to understand text prompts. Run imagesorcery-mcp --post-install again. If it still fails, especially if you use uv, you may need to install it manually into your active virtual environment with pip install git+https://github.com/ultralytics/CLIP.git.

Q: Why do I need to install system libraries like ffmpeg?
A: ImageSorcery uses OpenCV (cv2) under the hood for most of its image processing tasks. OpenCV, in turn, depends on these system-level libraries for handling various image and video codecs and for rendering windows. They are often missing in minimal environments like Docker containers.

Q: How do I add and use a new object detection model?
A: The project includes a script for this. You can download models from sources like Ultralytics or Hugging Face directly from the command line. For example: download-yolo-models --huggingface ultralytics/yolov8:yolov8m.pt. Once downloaded, the model will be registered in models/model_descriptions.json, and you can specify it in your prompts to the detect tool.

Q: Is the telemetry feature safe? Does it collect my image data?
A: The telemetry is completely safe and privacy-focused. It is disabled by default and you must explicitly opt-in. It NEVER collects any personal or sensitive data, including image data, file paths, or IP addresses. It only collects anonymized usage data (e.g., which tools are used, application version) to help the developers fix bugs and prioritize features.

Latest MCP Servers

CVE

An MCP Server that connects Claude to 27 security tools for CVE triage, EPSS checks, KEV status, exploit lookup, and package scanning.

WebMCP

webmcp is an MCP server that connects MCP clients to web search, page fetching, and local LLM-based extraction. It’s ideal…

Google Meta Ads GA4

An MCP server that connects AI assistants to Google Ads, Meta Ads, and GA4 for reporting, edits, and cross-platform analysis.

View More MCP Servers >>

Featured MCP Servers

Notion

Notion's official MCP Server allows you to interact with Notion workspaces through the Notion API.

Claude Peers

An MCP server that enables Claude Code instances to discover each other and exchange messages instantly via a local broker daemon with SQLite persistence.

Excalidraw

Excalidraw's official MCP server that streams interactive hand-drawn diagrams to Claude, ChatGPT, and VS Code with smooth camera control and fullscreen editing.

More Featured MCP Servers >>

FAQs

Q: What exactly is the Model Context Protocol (MCP)?

A: MCP is an open standard, like a common language, that lets AI applications (clients) and external data sources or tools (servers) talk to each other. It helps AI models get the context (data, instructions, tools) they need from outside systems to give more accurate and relevant responses. Think of it as a universal adapter for AI connections.

Q: How is MCP different from OpenAI's function calling or plugins?

A: While OpenAI's tools allow models to use specific external functions, MCP is a broader, open standard. It covers not just tool use, but also providing structured data (Resources) and instruction templates (Prompts) as context. Being an open standard means it's not tied to one company's models or platform. OpenAI has even started adopting MCP in its Agents SDK.

Q: Can I use MCP with frameworks like LangChain?

A: Yes, MCP is designed to complement frameworks like LangChain or LlamaIndex. Instead of relying solely on custom connectors within these frameworks, you can use MCP as a standardized bridge to connect to various tools and data sources. There's potential for interoperability, like converting MCP tools into LangChain tools.

Q: Why was MCP created? What problem does it solve?

A: It was created because large language models often lack real-time information and connecting them to external data/tools required custom, complex integrations for each pair. MCP solves this by providing a standard way to connect, reducing development time, complexity, and cost, and enabling better interoperability between different AI models and tools.

Q: Is MCP secure? What are the main risks?

A: Security is a major consideration. While MCP includes principles like user consent and control, risks exist. These include potential server compromises leading to token theft, indirect prompt injection attacks, excessive permissions, context data leakage, session hijacking, and vulnerabilities in server implementations. Implementing robust security measures like OAuth 2.1, TLS, strict permissions, and monitoring is crucial.

Q: Who is behind MCP?

A: MCP was initially developed and open-sourced by Anthropic. However, it's an open standard with active contributions from the community, including companies like Microsoft and VMware Tanzu who maintain official SDKs.