ScreenPilot
26
ScreenPilot is an MCP server designed to facilitate complete device control via a screen automation toolkit. It enables automation and interactivity with graphical user interfaces, providing features like screen capture, mouse control, and keyboard input.
ScreenPilot
MCP server to let LLM take full control on your device by providing screen automation toolkit for controlling and interacting with graphical user interfaces. Good for automation, education and having fun.
Main Features
- Screen capture and analysis
- Mouse control (clicking, positioning)
- Keyboard input (typing, key presses, hotkeys)
Available Tools
- Screen Capture: Take screenshots and get screen information
- Mouse Control: Move the mouse and perform clicks
- Keyboard Actions: Type text, press keys, and use hotkey combinations
- Scrolling: Scroll in different directions and to specific positions
- Element Detection: Check if elements exist on screen and wait for them to appear
- Action Sequences: Perform multiple actions in sequence
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.