mcp-browser-use
610
The browser-use MCP server is an AI-driven browser automation system that leverages natural language for browser control and web research. It supports the Model Context Protocol and offers robust tooling for deep research, compatible with various LLM providers.
browser-use MCP Server & CLI
- AI-driven browser automation server that uses the Model Context Protocol (MCP) for natural language browser control and web research.
- Provides a CLI for access to key functionalities.
Features
- Full MCP Integration for AI agent communication.
- Browser automation with natural language tasks such as navigation and form filling.
- Optional visual understanding through screenshots.
- State Persistence and multi-LLM support.
- Specialized tool for deep web research and report generation.
- Configurable via environment variables with Pydantic model.
Quick Start
- Install UV Python installer.
- Get Playwright browsers required for automation.
MCP Tools
Synchronous Tools
run_browser_agent
: Executes browser tasks with language instructions.run_deep_research
: Conducts detailed web research and generates reports.
CLI Usage
Commands like run-browser-agent
and run-deep-research
are accessible with options to set environment configurations.
Configuration
Configure the server via environment variables for LLM settings, browser, and server specifics. Detailed variable descriptions included.
License
MIT License.