Browserbeam MCP Server

Browserbeam MCP Server

Enables real browser automation as tools in Cursor, Claude Desktop, Windsurf, and any MCP-compatible client, allowing AI agents to interact with web pages through natural language.

Category
访问服务器

README

Browserbeam MCP Server

MCP (Model Context Protocol) server for Browserbeam — use real browser automation as tools in Cursor, Claude Desktop, Windsurf, and any MCP-compatible client.

Setup

Cursor

Add to ~/.cursor/mcp.json:

{
  "mcpServers": {
    "browserbeam": {
      "command": "npx",
      "args": ["-y", "@browserbeam/mcp-server"],
      "env": {
        "BROWSERBEAM_API_KEY": "sk_live_your_key_here"
      }
    }
  }
}

Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "browserbeam": {
      "command": "npx",
      "args": ["-y", "@browserbeam/mcp-server"],
      "env": {
        "BROWSERBEAM_API_KEY": "sk_live_your_key_here"
      }
    }
  }
}

Windsurf

Add to ~/.codeium/windsurf/mcp_config.json:

{
  "mcpServers": {
    "browserbeam": {
      "command": "npx",
      "args": ["-y", "@browserbeam/mcp-server"],
      "env": {
        "BROWSERBEAM_API_KEY": "sk_live_your_key_here"
      }
    }
  }
}

Available Tools

Tool Description
browserbeam_create_session Create a browser session, optionally navigate to a URL
browserbeam_navigate Navigate to a new URL in an existing session
browserbeam_observe Get page content as markdown or HTML with interactive element refs. Supports mode: "full" for all sections and include_page_map for a structural map
browserbeam_click Click an element by ref, text, or label
browserbeam_fill Fill form fields or an entire form at once
browserbeam_type Type text character-by-character with real keyboard events
browserbeam_select Select an option from a dropdown
browserbeam_check Check or uncheck a checkbox or radio button
browserbeam_scroll Scroll the page or scroll an element into view
browserbeam_scroll_collect Scroll the entire page to load lazy content, then observe
browserbeam_wait Wait for a selector, text, JS expression, or fixed delay
browserbeam_extract Extract structured data using a declarative schema
browserbeam_execute_js Run custom JavaScript in the browser page context
browserbeam_screenshot Take a screenshot of the current page
browserbeam_pdf Generate a PDF of the current page
browserbeam_upload Upload files to a file input element
browserbeam_list_sessions List your sessions; optional status filter: active, closed, or failed
browserbeam_get_session Get the status, duration, and error fields (e.g. error_code / error_message when status is failed) for a session
browserbeam_close Close a session and release resources

Page Map & Full Mode

The first observe in every session auto-includes a page map — a lightweight outline of page sections (nav, header, main, aside, footer) with CSS selectors and content hints. This lets agents discover what's on the page beyond the main content area without spending tokens.

To get content from all page sections instead of just the main area, use mode: "full":

{
  "tool": "browserbeam_observe",
  "params": {
    "session_id": "ses_abc123",
    "mode": "full",
    "max_text_length": 20000
  }
}

The response organizes content by section:

## [nav]
Home | Products | About | Contact

## [main]
# Welcome to Our Site
...main content...

## [aside]
Related links, sidebar widgets...

## [footer]
© 2026 Company | Privacy | Terms

Use include_page_map: true to re-request the page map on subsequent observations.

Proxies

All sessions use a datacenter proxy by default (country auto-detected from the URL's TLD). To customize, pass proxy_kind and/or proxy_country to browserbeam_create_session:

{
  "tool": "browserbeam_create_session",
  "params": {
    "url": "https://example.com",
    "proxy_kind": "residential",
    "proxy_country": "us"
  }
}

AI-Powered Selectors

Use the ai >> prefix in extract schemas to describe fields in plain English. The engine resolves them to CSS selectors via AI and caches the result:

{
  "tool": "browserbeam_extract",
  "params": {
    "session_id": "ses_abc123",
    "schema": {
      "_parent": "article.product_pod",
      "name": "ai >> the product title",
      "price": "ai >> the price including currency symbol"
    }
  }
}

Agent guidelines (for AI clients)

  • Close sessions: Agents should call browserbeam_close when finished with a session so resources are released and credit consumption stops. Only keep a session open if the user explicitly needs continued work on the same browser.
  • Page discovery: The first observe auto-includes a map. Check it before using mode: "full" — if the info you need is in the main content, default mode is more token-efficient.
  • Full mode: Use mode: "full" when you need sidebar content, footer links, or navigation items that aren't in the main area. Default max_text_length for full mode is 20,000 characters.
  • Truncation: Page markdown is capped by default at 12,000 characters (browserbeam_observe and the page payload from browserbeam_create_session / browserbeam_navigate). If output is truncated, use browserbeam_observe with a higher max_text_length or browserbeam_scroll_collect (default 100,000 characters) for long or lazy-loaded pages.

How It Works

The MCP server translates tool calls into Browserbeam API requests. Your AI agent sees structured page data (markdown content, interactive element refs, change diffs) instead of raw HTML.

AI Agent → MCP Tool Call → Browserbeam API → Real Browser → Structured Response → AI Agent

Environment Variables

Variable Required Description
BROWSERBEAM_API_KEY Yes Your Browserbeam API key (sk_live_...)
BROWSERBEAM_BASE_URL No API base URL (default: https://api.browserbeam.com)

Get an API Key

Sign up at browserbeam.com — 5,000 free credits, no credit card required.

License

MIT

推荐服务器

Baidu Map

Baidu Map

百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。

官方
精选
JavaScript
Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。

官方
精选
本地
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。

官方
精选
本地
TypeScript
VeyraX

VeyraX

一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。

官方
精选
本地
graphlit-mcp-server

graphlit-mcp-server

模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。

官方
精选
TypeScript
Kagi MCP Server

Kagi MCP Server

一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。

官方
精选
Python
e2b-mcp-server

e2b-mcp-server

使用 MCP 通过 e2b 运行代码。

官方
精选
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
Exa MCP Server

Exa MCP Server

模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。

官方
精选