MCP Macaco Playwright
Enables comprehensive browser automation and web interaction through Playwright with 50+ specialized functions for navigation, form filling, data extraction, and Chrome DevTools Protocol support. Designed specifically for AI agents to perform complex web workflows including scraping, testing, and automated browsing tasks.
README
MCP Macaco Playwright
Enhanced Playwright Tools for Model Context Protocol (MCP) with Chrome DevTools Protocol (CDP) Support
Overview
MCP Macaco Playwright is a comprehensive browser automation server that provides AI agents with powerful web interaction capabilities through the Model Context Protocol. It combines Playwright's robust browser automation with CDP integration for advanced debugging and control scenarios.
Features
- Complete Browser Automation: Navigate, interact, and extract data from web pages
- Chrome DevTools Protocol (CDP) Support: Connect to existing browser instances
- AI-Optimized: Designed specifically for AI agents and automated workflows
- Comprehensive Tool Set: 50+ specialized functions for web automation
- Multi-Browser Support: Chrome, Firefox, Safari, and Edge
- Screenshot & Snapshot Capabilities: Visual and accessibility-based page capture
- Form Automation: Complete form filling and submission workflows
- Network Monitoring: Track requests, responses, and console messages
Installation
npm install mcp-macaco-playwright
Quick Start
import { createConnection } from 'mcp-macaco-playwright';
// Create MCP server connection
const server = await createConnection();
// Use with MCP client
await client.callTool({
name: 'browser_navigate',
arguments: { url: 'https://example.com' }
});
Function Reference
Navigation Functions
browser_navigate
Navigate to a specific URL.
Parameters:
url(string, required): The URL to navigate to
Example:
await client.callTool({
name: 'browser_navigate',
arguments: { url: 'https://github.com' }
});
browser_navigate_back
Go back to the previous page in browser history.
Parameters: None
Example:
await client.callTool({
name: 'browser_navigate_back',
arguments: {}
});
browser_navigate_forward
Go forward to the next page in browser history.
Parameters: None
Example:
await client.callTool({
name: 'browser_navigate_forward',
arguments: {}
});
Page Analysis Functions
browser_snapshot
Capture an accessibility snapshot of the current page for analysis and interaction.
Parameters: None
Example:
await client.callTool({
name: 'browser_snapshot',
arguments: {}
});
browser_take_screenshot
Take a visual screenshot of the page or specific element.
Parameters:
type(string, optional): Image format ('png' or 'jpeg', default: 'png')filename(string, optional): Custom filename for the screenshotelement(string, optional): Human-readable element descriptionref(string, optional): Element reference from snapshotfullPage(boolean, optional): Capture full scrollable page
Example:
await client.callTool({
name: 'browser_take_screenshot',
arguments: {
type: 'png',
fullPage: true,
filename: 'homepage.png'
}
});
Element Interaction Functions
browser_click
Click on a specific element on the page.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_click',
arguments: {
element: 'Sign in button',
ref: 'button-signin-123'
}
});
browser_double_click
Perform a double-click on an element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_double_click',
arguments: {
element: 'File icon',
ref: 'file-icon-456'
}
});
browser_right_click
Perform a right-click to open context menu.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_right_click',
arguments: {
element: 'Image thumbnail',
ref: 'img-thumb-789'
}
});
Text Input Functions
browser_type
Type text into an editable element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshottext(string, required): Text to typesubmit(boolean, optional): Press Enter after typingslowly(boolean, optional): Type character by character
Example:
await client.callTool({
name: 'browser_type',
arguments: {
element: 'Search input field',
ref: 'search-input-123',
text: 'playwright automation',
submit: true
}
});
browser_press_key
Press a specific key on the keyboard.
Parameters:
key(string, required): Key name (e.g., 'Enter', 'ArrowLeft', 'a')
Example:
await client.callTool({
name: 'browser_press_key',
arguments: { key: 'Escape' }
});
Form Functions
browser_select_option
Select options in a dropdown menu.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotvalues(array, required): Array of values to select
Example:
await client.callTool({
name: 'browser_select_option',
arguments: {
element: 'Country dropdown',
ref: 'country-select-456',
values: ['United States']
}
});
browser_check_checkbox
Check or uncheck a checkbox element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotchecked(boolean, required): Whether to check (true) or uncheck (false)
Example:
await client.callTool({
name: 'browser_check_checkbox',
arguments: {
element: 'Terms and conditions checkbox',
ref: 'terms-checkbox-789',
checked: true
}
});
browser_select_radio
Select a radio button.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_select_radio',
arguments: {
element: 'Payment method: Credit Card',
ref: 'payment-radio-cc'
}
});
browser_clear_input
Clear the content of an input field.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_clear_input',
arguments: {
element: 'Email input field',
ref: 'email-input-123'
}
});
Data Extraction Functions
browser_get_text
Extract text content or attribute values from elements.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotattribute(string, optional): Specific attribute to extract (e.g., 'href', 'src')
Example:
await client.callTool({
name: 'browser_get_text',
arguments: {
element: 'Product price',
ref: 'price-display-456'
}
});
browser_get_elements
Get multiple elements matching a selector.
Parameters:
selector(string, required): CSS selector to find elementsattribute(string, optional): Attribute to extract from each element
Example:
await client.callTool({
name: 'browser_get_elements',
arguments: {
selector: '.product-card h3',
attribute: 'textContent'
}
});
Scrolling and Focus Functions
browser_scroll_to
Scroll to a specific element or coordinate position.
Parameters:
element(string, optional): Human-readable element descriptionref(string, optional): Element reference from page snapshotx(number, optional): X coordinate to scroll toy(number, optional): Y coordinate to scroll tobehavior(string, optional): Scroll behavior ('auto' or 'smooth')
Example:
await client.callTool({
name: 'browser_scroll_to',
arguments: {
element: 'Footer section',
ref: 'footer-section-123',
behavior: 'smooth'
}
});
browser_get_scroll_position
Get the current scroll position of the page.
Parameters: None
Example:
await client.callTool({
name: 'browser_get_scroll_position',
arguments: {}
});
browser_focus_element
Set focus on a specific element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_focus_element',
arguments: {
element: 'Search input',
ref: 'search-input-456'
}
});
browser_blur_element
Remove focus from a specific element.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshot
Example:
await client.callTool({
name: 'browser_blur_element',
arguments: {
element: 'Email input',
ref: 'email-input-789'
}
});
Wait Functions
browser_wait_for
Wait for specific conditions to be met.
Parameters:
time(number, optional): Time to wait in secondstext(string, optional): Text to wait for to appeartextGone(string, optional): Text to wait for to disappear
Example:
await client.callTool({
name: 'browser_wait_for',
arguments: {
text: 'Loading complete',
time: 5
}
});
Tab Management Functions
browser_tab_list
List all open browser tabs.
Parameters: None
Example:
await client.callTool({
name: 'browser_tab_list',
arguments: {}
});
browser_tab_new
Open a new browser tab.
Parameters:
url(string, optional): URL to navigate to in the new tab
Example:
await client.callTool({
name: 'browser_tab_new',
arguments: { url: 'https://example.com' }
});
browser_tab_close
Close a browser tab.
Parameters:
index(number, optional): Index of tab to close (closes current if not specified)
Example:
await client.callTool({
name: 'browser_tab_close',
arguments: { index: 1 }
});
browser_tab_select
Switch to a specific tab by index.
Parameters:
index(number, required): Index of the tab to select
Example:
await client.callTool({
name: 'browser_tab_select',
arguments: { index: 0 }
});
Network and Console Functions
browser_network_requests
Get all network requests made since page load.
Parameters: None
Example:
await client.callTool({
name: 'browser_network_requests',
arguments: {}
});
browser_console_messages
Get all console messages from the page.
Parameters: None
Example:
await client.callTool({
name: 'browser_console_messages',
arguments: {}
});
Chrome DevTools Protocol (CDP) Functions
browser_connect_cdp
Connect to an existing browser instance via CDP.
Parameters:
endpoint(string, required): CDP endpoint URLtimeout(number, optional): Connection timeout in milliseconds (default: 30000)
Example:
await client.callTool({
name: 'browser_connect_cdp',
arguments: {
endpoint: 'ws://localhost:9222',
timeout: 30000
}
});
browser_get_cdp_endpoints
Discover available CDP endpoints from running browsers.
Parameters:
port(number, optional): CDP port to check (default: 9222)host(string, optional): Host to check (default: 'localhost')
Example:
await client.callTool({
name: 'browser_get_cdp_endpoints',
arguments: {
port: 9222,
host: 'localhost'
}
});
browser_disconnect_cdp
Disconnect from the current CDP connection.
Parameters: None
Example:
await client.callTool({
name: 'browser_disconnect_cdp',
arguments: {}
});
JavaScript Evaluation Functions
browser_evaluate
Execute JavaScript code in the browser context.
Parameters:
script(string, required): JavaScript code to execute
Example:
await client.callTool({
name: 'browser_evaluate',
arguments: {
script: 'document.title'
}
});
Dialog Handling Functions
browser_handle_dialog
Handle browser dialogs (alert, confirm, prompt).
Parameters:
action(string, required): Action to take ('accept' or 'dismiss')text(string, optional): Text to enter for prompt dialogs
Example:
await client.callTool({
name: 'browser_handle_dialog',
arguments: {
action: 'accept',
text: 'User input'
}
});
File Functions
browser_upload_file
Upload files to file input elements.
Parameters:
element(string, required): Human-readable element descriptionref(string, required): Element reference from page snapshotfiles(array, required): Array of file paths to upload
Example:
await client.callTool({
name: 'browser_upload_file',
arguments: {
element: 'File upload input',
ref: 'file-input-123',
files: ['/path/to/document.pdf']
}
});
PDF Functions
browser_save_pdf
Save the current page as a PDF.
Parameters:
filename(string, optional): Custom filename for the PDFformat(string, optional): Page format (e.g., 'A4', 'Letter')landscape(boolean, optional): Use landscape orientation
Example:
await client.callTool({
name: 'browser_save_pdf',
arguments: {
filename: 'report.pdf',
format: 'A4',
landscape: false
}
});
Configuration
The MCP server can be configured with various options:
const config = {
browser: {
headless: false,
viewport: { width: 1280, height: 720 },
cdpEndpoint: 'ws://localhost:9222' // Optional CDP connection
},
capabilities: ['core', 'vision'] // Enable specific tool capabilities
};
const server = await createConnection(config);
Common Usage Patterns
Web Scraping Workflow
// Navigate to page
await client.callTool({
name: 'browser_navigate',
arguments: { url: 'https://example.com' }
});
// Take snapshot to analyze page structure
await client.callTool({
name: 'browser_snapshot',
arguments: {}
});
// Extract data from elements
await client.callTool({
name: 'browser_get_text',
arguments: {
element: 'Product title',
ref: 'product-title-123'
}
});
Form Automation Workflow
// Navigate to form page
await client.callTool({
name: 'browser_navigate',
arguments: { url: 'https://example.com/contact' }
});
// Fill form fields
await client.callTool({
name: 'browser_type',
arguments: {
element: 'Name input',
ref: 'name-input-123',
text: 'John Doe'
}
});
// Submit form
await client.callTool({
name: 'browser_click',
arguments: {
element: 'Submit button',
ref: 'submit-btn-456'
}
});
CDP Integration Workflow
// Connect to existing browser
await client.callTool({
name: 'browser_connect_cdp',
arguments: { endpoint: 'ws://localhost:9222' }
});
// Work with existing tabs
await client.callTool({
name: 'browser_tab_list',
arguments: {}
});
// Disconnect when done
await client.callTool({
name: 'browser_disconnect_cdp',
arguments: {}
});
Troubleshooting
CDP Connection Issues
If you encounter CDP connection problems:
-
Start Chrome with debugging enabled:
google-chrome --remote-debugging-port=9222 --remote-debugging-address=0.0.0.0 -
Verify the endpoint is accessible:
curl http://localhost:9222/json/version -
Check for firewall or network restrictions
Browser Launch Issues
- Ensure Playwright browsers are installed:
npx playwright install - Check system dependencies:
npx playwright install-deps - Verify sufficient system resources for browser instances
License
Apache License 2.0 - see LICENSE file for details.
Contributing
Contributions are welcome! Please see the contributing guidelines for more information.
Support
For issues and questions:
- GitHub Issues: https://github.com/macacoai/mcp-playwright/issues
- Email: gaston@macaco.ai
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。