mcp-browser-use

610

The browser-use MCP server is an AI-driven browser automation system that leverages natural language for browser control and web research. It supports the Model Context Protocol and offers robust tooling for deep research, compatible with various LLM providers.

browser-use MCP Server & CLI

  • AI-driven browser automation server that uses the Model Context Protocol (MCP) for natural language browser control and web research.
  • Provides a CLI for access to key functionalities.

Features

  • Full MCP Integration for AI agent communication.
  • Browser automation with natural language tasks such as navigation and form filling.
  • Optional visual understanding through screenshots.
  • State Persistence and multi-LLM support.
  • Specialized tool for deep web research and report generation.
  • Configurable via environment variables with Pydantic model.

Quick Start

  1. Install UV Python installer.
  2. Get Playwright browsers required for automation.

MCP Tools

Synchronous Tools

  • run_browser_agent: Executes browser tasks with language instructions.
  • run_deep_research: Conducts detailed web research and generates reports.

CLI Usage

Commands like run-browser-agent and run-deep-research are accessible with options to set environment configurations.

Configuration

Configure the server via environment variables for LLM settings, browser, and server specifics. Detailed variable descriptions included.

License

MIT License.