mcp-ocr

mcp-ocr

3

The MCP OCR Server is a robust, production-grade OCR solution built using the Model Context Protocol. Designed for simplicity and efficiency, it extracts text from various image sources, supports multiple languages, and provides robust error handling.

MCP OCR Server

A production-grade OCR server built using MCP (Model Context Protocol) that provides OCR capabilities through a simple interface.

Features

  • Extract text from images using Tesseract OCR
  • Support for multiple input types including local files, URLs, and raw bytes
  • Automatic Tesseract installation on supported platforms
  • Multi-language support
  • Robust error handling

Installation

Tesseract will be installed automatically on supported platforms: macOS, Linux, and Windows.

Usage

Start the server and configure it for usage with tools like Claude for Desktop. Available tools include perform_ocr for extracting text from images and get_supported_languages for listing available OCR languages.

Development

Instructions for setting up the development environment, running tests, and contributing to the project are provided.

Security

Follow security best practices, avoid committing sensitive credentials, and use environment variables for secure storage.