Screenshot by kazuph - MCP Server

MCP Screenshot

An MCP server that captures screenshots and performs OCR text recognition.

Features

Screenshot capture (left half, right half, full screen)
OCR text recognition (supports Japanese and English)
Multiple output formats (JSON, Markdown, vertical, horizontal)

OCR Engines

This server uses two OCR engines:

yomitoku (high-accuracy Japanese text recognition, runs as an API server)
Tesseract.js (fallback engine, supports Japanese and English)

Usage Example

Instruct Claude for screenshots and text recognition, for example, "Please take a screenshot of the left half of the screen and recognize the text in it."

Tool Specification

capture

Options: region (left/right/full, default: left), format (json/markdown/vertical/horizontal, default: markdown)