mcp-ocr
The MCP OCR Server is a robust, production-grade OCR solution built using the Model Context Protocol. Designed for simplicity and efficiency, it extracts text from various image sources, supports multiple languages, and provides robust error handling.
MCP OCR Server
A production-grade OCR server built using MCP (Model Context Protocol) that provides OCR capabilities through a simple interface.
Features
- Extract text from images using Tesseract OCR
- Support for multiple input types including local files, URLs, and raw bytes
- Automatic Tesseract installation on supported platforms
- Multi-language support
- Robust error handling
Installation
Tesseract will be installed automatically on supported platforms: macOS, Linux, and Windows.
Usage
Start the server and configure it for usage with tools like Claude for Desktop. Available tools include perform_ocr
for extracting text from images and get_supported_languages
for listing available OCR languages.
Development
Instructions for setting up the development environment, running tests, and contributing to the project are provided.
Security
Follow security best practices, avoid committing sensitive credentials, and use environment variables for secure storage.