EdgeBox
A local sandbox that provides AI agents with code execution, filesystem access, and a full GUI desktop environment for 'computer use' capabilities. It exposes tools for shell commands, multi-language code interpretation, and remote desktop automation via the Model Context Protocol.
README
EdgeBox - The Local Desktop Sandbox for AI Agents
<div align="center"> <img src="assets/icon/icon.png" alt="EdgeBox Logo" width="128" height="128" /> <br/> <strong>A fully-featured, GUI-powered local LLM Agent sandbox with complete support for the MCP protocol.</strong> <br/> <p>Empower your Large Language Models (LLMs) with true "Computer Use" capabilities.</p> </div>
EdgeBox is a powerful desktop application that brings the cloud-based sandbox capabilities of E2B (e2b.dev) to your local machine. Based on the open-source E2B Code Interpreter project, EdgeBox transforms the sandbox into a locally-running environment, giving you full control over your AI agent's development and execution environment.
What makes EdgeBox unique: While most open-source sandbox projects only provide a terminal/CLI, EdgeBox offers both a command-line shell AND a full graphical (GUI) desktop environment via an integrated VNC viewer. This means your LLM Agent is no longer just a code executor—it's a digital worker that can operate browsers, use VS Code, and interact with desktop applications, just like a human.
<div align="center"> <img src="assets/screenshots/main-app.png" alt="EdgeBox Main Application" width="800" /> <p><em>The EdgeBox Main Application Dashboard</em></p> </div> <div align="center"> <img src="assets/screenshots/computer-use.gif" alt="Computer-use Demo" width="800" /> <p><em>Computer Use Demo: types "google.com", presses Enter, and captures screenshot - showing computer use capabilities.</em></p> </div>
🤔 Why Choose EdgeBox?
| Feature | EdgeBox | Other OSS Sandboxes (e.g., codebox) |
|---|---|---|
| Environment | 🖥️ Local | 🖥️ Local |
| Interface | GUI + CLI | CLI-Only |
| Capability | Computer Use & Code Interpreter | Code Interpreter |
| Data Privacy | ✅ 100% Private | ✅ 100% Private |
| Latency | ⚡️ Near-Zero | ⚡️ Near-Zero |
| Integration | ✅ MCP Compliant | Proprietary API |
📖 Table of Contents
- EdgeBox - The Local Desktop Sandbox for AI Agents
🚀 Core Features
EdgeBox exposes all its capabilities through the MCP protocol, organized into three core modules for your LLM Agent.
1. 💻 Full Desktop Environment (Computer Use)
- VNC Remote Desktop: Access a complete, interactive Ubuntu desktop environment.
- Pre-installed Applications: Comes with Google Chrome, VS Code, and other essential tools out of the box.
- GUI Automation: Your agent can programmatically control the mouse and keyboard to interact with any desktop application.
- Visual Perception: Built-in screenshot capabilities provide visual context to the agent, enabling it to "see" and react to the GUI.
<div align="center"> <img src="assets/screenshots/vnc.gif" alt="VNC Desktop Environment Demo" width="800" /> <p><em>An interactive VNC session with VS Code and a browser.</em></p> </div>
2. 🐚 Complete Code Interpreter & Shell
- Secure Code Execution: Safely run AI-generated code in an isolated Docker container.
- Full Shell Access: A fully-featured
bashterminal allows the execution of any Linux command. - Isolated Filesystem: Each session gets a separate filesystem with full support for creating, reading, writing, and deleting files.
- Multi-language Support: Native support for Python, JavaScript (Node.js), and other runtimes.
3. 🔗 Seamless LLM Agent Integration (via MCP)
- Standardized Protocol: All sandbox features are exposed via the MCP (Model Context Protocol) HTTP interface.
- Broad Client Compatibility: Easily connect to any LLM client that supports MCP, such as Claude Desktop, OpenWebUI, LobeChat, and more.
- Multi-Session Management: Create and manage multiple, isolated sandbox sessions concurrently using the
x-session-idheader.
🛠️ Available MCP Tools
EdgeBox exposes its capabilities through MCP tools, organized into two categories:
📟 Core Tools (CLI Mode - Always Available)
Code Execution Tools - Execute code in various languages:
execute_python- Execute Python code in isolated environmentexecute_typescript- Execute TypeScript/JavaScript codeexecute_r- Execute R code for statistical analysisexecute_java- Execute Java codeexecute_bash- Execute Bash scripts
Shell Commands - Interact with the Linux environment:
shell_run- Run shell commands (stateful, persistent environment)shell_run_background- Run commands in background with process management
Filesystem Operations - Manage files and directories:
fs_list- List files in directoriesfs_read- Read file contentsfs_write- Write content to filesfs_info- Get file metadata and informationfs_watch- Monitor directory changes in real-time
🖱️ Desktop Tools (GUI Mode - When GUI Tools Enabled)
Mouse Controls - Programmatic mouse interaction:
desktop_mouse_click- Perform mouse clicks (left/right/middle)desktop_mouse_double_click- Double-click actionsdesktop_mouse_move- Move cursor to coordinatesdesktop_mouse_scroll- Scroll up/down with configurable amountdesktop_mouse_drag- Drag from one position to another
Keyboard Controls - Text input and key combinations:
desktop_keyboard_type- Type text with clipboard support for non-ASCIIdesktop_keyboard_press- Press specific keys (Return, Escape, Tab, etc.)desktop_keyboard_combo- Execute key combinations (Ctrl+C, Alt+Tab, etc.)
Window Management - Control desktop applications:
desktop_get_windows- List all windows with titles and IDsdesktop_switch_window- Focus specific windowsdesktop_maximize_window- Maximize windowsdesktop_minimize_window- Minimize windowsdesktop_resize_window- Resize windows to specific dimensions
Visual & Application Control:
desktop_screenshot- Capture desktop screenshots (PNG format)desktop_launch_app- Launch applications by namedesktop_wait- Add delays between actions
Note: Desktop tools are only available when GUI Tools are enabled in EdgeBox settings. Core tools are always available regardless of GUI settings.
🏗️ Architecture
EdgeBox is designed to provide a seamless and powerful local execution environment for LLM agents.
[LLM Agent (Claude, GPT, etc.)] <- MCP (HTTP Stream) -> [EdgeBox App] <- Docker API -> [Isolated Sandbox Container (Desktop + Shell)]
- Frontend: Electron + React + TypeScript + Tailwind CSS
- Backend: Node.js + Dockerode (for Docker API)
- Containerization: Docker
- UI Components: Radix UI
📋 Prerequisites
- Docker Desktop: Must be installed and running.
🛠️ Installation
-
Download EdgeBox Download the latest release for your platform from the Releases page.
-
Install & Run Docker Desktop Ensure Docker Desktop is installed and running before starting EdgeBox.
-
Run EdgeBox
- Windows: Run
EdgeBox.exe - macOS: Open
EdgeBox.app - Linux: Run the AppImage or install the
.deb/.rpmpackage.
- Windows: Run
🎯 Usage
Quick Start
- Launch EdgeBox and ensure Docker is running.
- Check the dashboard to verify all components (Docker, MCP Server) are healthy.
- Add the EdgeBox MCP configuration to your LLM client.
MCP Client Configuration
Add EdgeBox to your LLM client with this configuration:
{
"mcpServers": {
"edgebox": {
"url": "http://localhost:8888/mcp"
}
}
}
Instructing Your LLM Agent
Once configured, you can give your LLM agent natural language instructions like:
- Code Execution: "Write a Python script to analyze this CSV file and show me the output."
- File Operations: "Create a new folder called 'project', and inside it, create a file named
main.py." - Computer Use: "Open the browser, navigate to 'github.com', search for 'EdgeBox', and then take a screenshot for me."
Multi-Session Concurrent Sandboxes
Easily manage multiple isolated environments by specifying an x-session-id in your MCP request headers.
Example configuration for different tasks:
{
"mcpServers": {
"edgebox-default": {
"url": "http://localhost:8888/mcp"
},
"edgebox-data-analysis": {
"url": "http://localhost:8888/mcp",
"headers": {
"x-session-id": "data-analysis"
}
},
"edgebox-web-scraping": {
"url": "http://localhost:8888/mcp",
"headers": {
"x-session-id": "web-scraping"
}
}
}
}
🔐 Security
- Container Isolation: Every sandbox session runs in a separate Docker container.
- Resource Limits: Configurable CPU and memory constraints prevent resource abuse.
- Network Isolation: Container networking is controlled to protect the host machine.
📄 License
See the LICENSE file for details.
🙏 Acknowledgments
- E2B Team: For creating the fantastic open-source E2B Code Interpreter project that inspired EdgeBox.
- Docker: For the powerful containerization technology.
- Electron: For making cross-platform desktop apps possible.
🔗 Related Projects
- E2B Code Interpreter - The original project that served as our foundation.
- FastMCP - An implementation of the Model Context Protocol (MCP).
📞 Support
- Issues: Report bugs and feature requests on GitHub Issues.
- Discussions: Join the conversation in GitHub Discussions.
<!-- end list -->
推荐服务器
Baidu Map
百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。
Playwright MCP Server
一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。
Magic Component Platform (MCP)
一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。
Audiense Insights MCP Server
通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。
VeyraX
一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。
graphlit-mcp-server
模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。
Kagi MCP Server
一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。
e2b-mcp-server
使用 MCP 通过 e2b 运行代码。
Neon MCP Server
用于与 Neon 管理 API 和数据库交互的 MCP 服务器
Exa MCP Server
模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。