An MCP server that lets AI agents control web browsers through the browser-use framework. Supports SSE or stdio transport, VNC streaming and async browser tasks.
https://github.com/co-browser/browser-use-mcp-serverStop wrestling with web scraping APIs that break every update or manually feeding screenshots to your AI. This MCP server puts a full browser directly in your AI agent's hands through the Model Context Protocol.
Your AI can analyze code, write documentation, and solve complex problems—but ask it to grab the latest pricing from a competitor's website or fill out a form, and you're back to writing brittle scraping scripts. Meanwhile, you're manually screenshotting pages and pasting them into chat windows like it's 2020.
Browser automation shouldn't require you to maintain separate Selenium scripts or pay for expensive proxy services that still get blocked.
This MCP server bridges the gap between AI reasoning and web interaction. Your AI agents get direct browser control through Playwright—the same automation framework you'd use for testing, but now accessible through natural language instructions.
Real browser automation, not API calls:
Seamless AI integration:
Competitive Research:
"Check pricing on three competitor SaaS products and compare their feature matrices"
Your AI navigates to each site, extracts current pricing, and builds a comparison table—without you writing a single scraping script.
QA and Testing:
"Test the checkout flow on staging, try different payment methods, and document any UX issues"
Perfect for validating user flows before releases, especially with the VNC streaming so you can watch the automation happen.
Data Collection:
"Gather contact information from the team pages of Y Combinator's latest batch"
Handles dynamic loading, infinite scroll, and all the complexity that breaks traditional scrapers.
Documentation and Screenshots:
"Create annotated screenshots of our new dashboard for the user guide"
Automate screenshot capture for documentation, with the AI understanding context and visual elements.
Add this to your MCP configuration and you're done:
{
"mcpServers": {
"browser-use-mcp-server": {
"url": "http://localhost:8000/sse"
}
}
}
The VNC streaming feature means you can actually watch your AI work—invaluable for debugging complex automation or just satisfying your curiosity about how it solves browser challenges.
Dual transport modes give you flexibility: SSE for simple setups, stdio for complex integrations. Docker support means consistent deployment across environments. The async architecture handles multiple browser tasks without blocking your main workflow.
With 587 stars and active development, this isn't an experimental side project—it's a production-ready tool that's already changing how developers think about AI-browser interaction.
Ready to stop manually screenshotting web pages for your AI? Your agents are about to get a lot more capable.