screen-capture-mcp

screen-capture-mcp

An MCP server that enables Claude Code and other MCP clients to take screenshots of the full screen or specific windows, with support for repeated captures over intervals.

Category
访问服务器

README

screen-capture-mcp

npm version License: MIT

A quick and easy MCP server that gives Claude Code (or any MCP client) the ability to take screenshots of your screen or specific programs over any specified interval (e.g. "take a screenshot of Unreal Engine every 5 seconds for the next minute".) Useful when working on games, GUIs, or anything visual where Claude needs to see what you see.

Windows only — uses PowerShell and .NET for screen capture.

Why?

When you're working with Claude Code on something visual — a game, a UI, a 3D editor — Claude is blind. It can read your code but has no idea what the result actually looks like. You end up manually screenshotting, dragging images into the chat, and explaining what you're looking at.

This MCP server fixes that. Once installed, Claude can take screenshots on its own whenever it needs to see what's happening on screen.

Features

  • Full screen capture — captures your primary display
  • Window capture — capture a specific window by title (partial match)
  • Auto-resize — images are resized to 1280px wide to save tokens
  • Zero native dependencies — uses PowerShell/.NET built into Windows, no native compilation needed

Installation

Via npm (recommended)

npm install -g screen-capture-mcp

Then register it with Claude Code:

claude mcp add -s user -t stdio screen-capture-mcp -- screen-capture-mcp

Or add it manually to your ~/.claude.json:

{
  "mcpServers": {
    "screen-capture-mcp": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "screen-capture-mcp"]
    }
  }
}

From source

git clone https://github.com/kmoulder/screen-capture-mcp.git
cd screen-capture-mcp
npm install
npm run build
claude mcp add -s user -t stdio screen-capture-mcp -- node /path/to/screen-capture-mcp/dist/index.js

Restart Claude Code after registering.

Usage

Once registered, Claude Code can call the take_screenshot tool. You can ask for screenshots naturally:

"Take a screenshot"
"Take a screenshot of the Godot window"
"Show me what the game looks like right now"

Monitoring Over Time

Because Claude can call the tool repeatedly, you can ask it to watch your screen over time:

"Take a screenshot every 5 seconds for the next minute and describe what changes"
"Wait 30 seconds then take a screenshot"
"Watch the Unity window and let me know when the build finishes"
"Capture the game window every 10 seconds while I playtest — give me feedback on the UI"

This is especially useful for:

  • Playtesting feedback — run your game and get real-time observations from Claude as you play
  • Build monitoring — have Claude watch a long-running build or deployment and notify you when it's done
  • UI iteration — make changes in an editor and have Claude compare screenshots to track progress
  • Bug reproduction — ask Claude to capture screenshots while you reproduce a bug, then analyze the sequence

Bringing Visual Context Into Coding Tasks

You can also mix screenshots into normal development work:

"Look at the game window and then fix the player sprite so it faces the right direction"
"Take a screenshot of the editor, then update the CSS to match the mockup I have open"
"Check what the app looks like in the browser and fix any layout issues you see"

This closes the loop between writing code and seeing results — Claude can make a change, screenshot the result, and iterate.

Tool Schema

take_screenshot(window_title?: string)
Parameter Type Description
window_title string (optional) Window title to capture (partial match). Omit for full screen.

Returns a base64-encoded PNG image.

Privacy

Screenshots are captured and processed entirely on your local machine. Nothing is uploaded, saved to disk, or sent to any external service by this tool.

The captured image is:

  1. Taken locally via PowerShell/.NET
  2. Resized in-memory using sharp
  3. Passed directly to Claude Code via the MCP protocol as base64

No screenshots are written to your filesystem — they exist only in memory for the duration of the MCP tool call. The image data is sent to the Claude API as part of your conversation (the same as if you had dragged a screenshot into the chat yourself), but it is never stored or logged by this server.

If you want to verify this, the entire server is a single file — src/index.ts.

How It Works

  • Uses PowerShell with System.Drawing and System.Windows.Forms to capture the screen
  • For window-specific capture, uses user32.dll GetWindowRect via P/Invoke to find and capture the target window
  • Images are resized to 1280px wide using sharp before being returned to keep token usage reasonable
  • No temp files or disk writes — everything happens in memory

Requirements

  • Windows 10/11
  • Node.js 18+
  • PowerShell (included with Windows)

License

MIT

推荐服务器

Baidu Map

Baidu Map

百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。

官方
精选
JavaScript
Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。

官方
精选
本地
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。

官方
精选
本地
TypeScript
VeyraX

VeyraX

一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。

官方
精选
本地
graphlit-mcp-server

graphlit-mcp-server

模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。

官方
精选
TypeScript
Kagi MCP Server

Kagi MCP Server

一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。

官方
精选
Python
e2b-mcp-server

e2b-mcp-server

使用 MCP 通过 e2b 运行代码。

官方
精选
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
Exa MCP Server

Exa MCP Server

模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。

官方
精选