mcp-operator
2.7
The MCP Browser Operator is an MCP server specialized in automating web browsers. It enables management and interaction with web content through JSON-RPC, with features like web page auditing and persistent browser states for seamless session handling.
MCP Browser Operator
A Model Control Protocol (MCP) server designed for browser automation, enabling the control of a web browser and interaction with web pages via a standardized JSON-RPC interface.
Features
- Browser Management: Handle browser instances with create, navigate, operate, and close functions.
- Job Management: Track browser operations using job IDs.
- Web Interaction: Use OpenAI's Computer Use API for interactive tasks.
- Browser Tools: Access console logs, network activity, and perform page audits for accessibility and performance.
- Auditing: Conduct audits for SEO and other aspects.
Requirements
- Python 3.11+
- Playwright
- OpenAI API key
Installation
- Clone the repository and install dependencies and browsers.
- Set up OpenAI API key.
Usage
- Start the server which listens for JSON-RPC requests and manages jobs asynchronously.
Core Methods
- Browser Management: Create, navigate, operate, and close browser instances.
- Job Management: Get job status and list jobs.
- User Notes: Add notes related to browser operations.
Asynchronous Workflow Pattern
Relies on job-based asynchronous operations to avoid client timeouts.
Persistent Browser State
Maintains state like cookies and local storage between sessions, supporting multiple independent browser states.