ImageSorcery MCP

The ImageSorcery MCP Server allows your local AI assistant like Claude Code and Cursor to edit and analyze images directly on your local machine.

This means you can tell your AI assistant, in plain language, to perform tasks like cropping photos, removing backgrounds, or detecting objects, and it all happens privately on your computer. You don’t have to upload your images to any third-party service.

Key Features

  • 🎯 Object detection with segmentation masks and polygon outputs
  • 📝 OCR text extraction from images using EasyOCR
  • ✂️ Basic image operations: crop, resize, rotate
  • 🎨 Advanced editing: background removal, color changes, blur effects
  • 📐 Drawing tools: arrows, circles, rectangles, text overlays
  • 🔍 Text-based object finding using CLIP models
  • 📊 Image metadata extraction
  • 🖼️ Image overlaying and watermarking
  • ⚙️ Configurable confidence thresholds and model settings

How To Use It

1. Install the package using pipx.

pipx install imagesorcery-mcp

2. Run the post-installation script.

imagesorcery-mcp --post-install

3. Add the ImageSorcery server to your MCP client’s configuration file. For an installation done with pipx, the configuration looks like this:

"mcpServers": {
    "imagesorcery-mcp": {
      "command": "imagesorcery-mcp",
      "transportType": "stdio",
      "autoApprove": ["blur", "change_color", "config", "crop", "detect", "draw_arrows", "draw_circles", "draw_lines", "draw_rectangles", "draw_texts", "fill", "find", "get_metainfo", "ocr", "overlay", "resize", "rotate"],
      "timeout": 100
    }
}

If you installed it manually into a virtual environment, you’ll need to provide the full path to the imagesorcery-mcp executable in the "command" field.

4. Once configured, you can start making requests to your AI assistant. For best results, be specific. For example:

  • "use imagesorcery to find a cat in photo.jpg and crop the image to center the cat"
  • "use imagesorcery to extract all text from document.png using OCR"

5. All available tools

ToolDescription
blurBlurs specified rectangular or polygonal areas of an image.
change_colorChanges the color palette of an image (e.g., to sepia).
configView and update ImageSorcery MCP configuration settings.
cropCrops an image using NumPy slicing.
detectDetects objects using models from Ultralytics.
draw_arrowsDraws arrows on an image.
draw_circlesDraws circles on an image.
draw_linesDraws lines on an image.
draw_rectanglesDraws rectangles on an image.
draw_textsDraws text on an image.
fillFills specified areas with a color and opacity.
findFinds objects in an image based on a text description.
get_metainfoGets metadata information about an image file.
ocrPerforms Optical Character Recognition (OCR) on an image.
overlayOverlays one image on top of another.
resizeResizes an image.
rotateRotates an image.

FAQs

Q: The find tool isn’t working with text prompts. What’s wrong?
A: This is almost always because the clip Python package failed to install. The find tool relies on it to understand text prompts. Run imagesorcery-mcp --post-install again. If it still fails, especially if you use uv, you may need to install it manually into your active virtual environment with pip install git+https://github.com/ultralytics/CLIP.git.

Q: Why do I need to install system libraries like ffmpeg?
A: ImageSorcery uses OpenCV (cv2) under the hood for most of its image processing tasks. OpenCV, in turn, depends on these system-level libraries for handling various image and video codecs and for rendering windows. They are often missing in minimal environments like Docker containers.

Q: How do I add and use a new object detection model?
A: The project includes a script for this. You can download models from sources like Ultralytics or Hugging Face directly from the command line. For example: download-yolo-models --huggingface ultralytics/yolov8:yolov8m.pt. Once downloaded, the model will be registered in models/model_descriptions.json, and you can specify it in your prompts to the detect tool.

Q: Is the telemetry feature safe? Does it collect my image data?
A: The telemetry is completely safe and privacy-focused. It is disabled by default and you must explicitly opt-in. It NEVER collects any personal or sensitive data, including image data, file paths, or IP addresses. It only collects anonymized usage data (e.g., which tools are used, application version) to help the developers fix bugs and prioritize features.

Latest MCP Servers

CVE

An MCP Server that connects Claude to 27 security tools for CVE triage, EPSS checks, KEV status, exploit lookup, and package scanning.

WebMCP

webmcp is an MCP server that connects MCP clients to web search, page fetching, and local LLM-based extraction. It’s ideal…

Google Meta Ads GA4

An MCP server that connects AI assistants to Google Ads, Meta Ads, and GA4 for reporting, edits, and cross-platform analysis.

View More MCP Servers >>

Featured MCP Servers

Notion

Notion's official MCP Server allows you to interact with Notion workspaces through the Notion API.

Claude Peers

An MCP server that enables Claude Code instances to discover each other and exchange messages instantly via a local broker daemon with SQLite persistence.

Excalidraw

Excalidraw's official MCP server that streams interactive hand-drawn diagrams to Claude, ChatGPT, and VS Code with smooth camera control and fullscreen editing.

More Featured MCP Servers >>

FAQs

Q: What exactly is the Model Context Protocol (MCP)?

A: MCP is an open standard, like a common language, that lets AI applications (clients) and external data sources or tools (servers) talk to each other. It helps AI models get the context (data, instructions, tools) they need from outside systems to give more accurate and relevant responses. Think of it as a universal adapter for AI connections.

Q: How is MCP different from OpenAI's function calling or plugins?

A: While OpenAI's tools allow models to use specific external functions, MCP is a broader, open standard. It covers not just tool use, but also providing structured data (Resources) and instruction templates (Prompts) as context. Being an open standard means it's not tied to one company's models or platform. OpenAI has even started adopting MCP in its Agents SDK.

Q: Can I use MCP with frameworks like LangChain?

A: Yes, MCP is designed to complement frameworks like LangChain or LlamaIndex. Instead of relying solely on custom connectors within these frameworks, you can use MCP as a standardized bridge to connect to various tools and data sources. There's potential for interoperability, like converting MCP tools into LangChain tools.

Q: Why was MCP created? What problem does it solve?

A: It was created because large language models often lack real-time information and connecting them to external data/tools required custom, complex integrations for each pair. MCP solves this by providing a standard way to connect, reducing development time, complexity, and cost, and enabling better interoperability between different AI models and tools.

Q: Is MCP secure? What are the main risks?

A: Security is a major consideration. While MCP includes principles like user consent and control, risks exist. These include potential server compromises leading to token theft, indirect prompt injection attacks, excessive permissions, context data leakage, session hijacking, and vulnerabilities in server implementations. Implementing robust security measures like OAuth 2.1, TLS, strict permissions, and monitoring is crucial.

Q: Who is behind MCP?

A: MCP was initially developed and open-sourced by Anthropic. However, it's an open standard with active contributions from the community, including companies like Microsoft and VMware Tanzu who maintain official SDKs.

Get the latest & top AI tools sent directly to your email.

Subscribe now to explore the latest & top AI tools and resources, all in one convenient newsletter. No spam, we promise!