ahk-mcp
ahk-mcp is a Model Context Protocol (MCP) server that extends AutoHotkey functions for automating tasks on Windows. It supports operations like window enumeration, keyboard and mouse control, and screen capture with OCR, offering a total of 33 tools.
ahk-mcp
MCP server exposing AutoHotkey functionality, enabling model interfaces to automation tasks on Windows.
This server only works on Windows and provides the following tools to your AI:
- Ability to enumerate windows/applications
- Ability to control keyboard/mouse (typing, clicking, etc)
- Ability to interrogate Windows APIs (via AutoHotkey) about windows (e.g., to get the text of a window, the position of its GUI controls, etc.)
- Screen capture & OCR functionality (useful when text is not exposed properly by Windows APIs/controls)
- Ability to get accurate window positioning and contextual information about computer monitors (e.g., know what windows are on your primary/secondary monitors)
- Ability to manipulate windows and other actions via AutoHotkey
In total, there are 33 tools currently exposed by the server, the above is just a simple overview.
While we work on documentation, exploring the source code in main.py
is encouraged!
Usage
This project makes use of the Python MCP SDK with FastMCP. Please see the Python MCP SDK repo and documentation for detailed information.
Assuming you've already setup mcp
CLI, you can install this MCP service in Claude Desktop with a simple mcp
command:
mcp install main.py
This project depends on ahk-binary
to provide the required AutoHotkey executables and the ahk
project to interface with AutoHotkey. It uses mss
, easyocr
, and numpy
for OCR functionality.
Contributing
The best way to contribute is right here on GitHub. Please feel free to open an issue to get started. Pull requests are also welcome, but it is strongly recommended to open an issue first, especially for significant changes.