WinWright

WinWright

civyk-winwright is a Playwright-style MCP server for Windows desktop and browser automation. It provides 59 tools for controlling WPF, WinForms, Win32 apps, and Chrome/Edge via the Model Context Protocol. A unified automation layer connecting Claude and MCP-compatible agents to native Windows UI and web interfaces.

Category
访问服务器

README

WinWright

GitHub Release License Platform MCP

Windows automation server for the Model Context Protocol. ~59 consolidated tools for desktop (WPF, WinForms, Win32), browser (Chrome/Edge via CDP), and system management — all accessible to AI agents over MCP.

Describe tests in plain English — the AI agent does the rest

WinWright Demo

You write test cases in plain English. The AI agent uses WinWright's MCP tools to discover UI controls, perform actions, and record everything as a portable JSON script.

Replay recorded scripts — no AI agent needed

Run Script Demo

Once recorded, scripts run deterministically with winwright run — no AI agent, no LLM calls, no token costs. Results are the same every time.

If the UI layout changes, WinWright can self-heal broken selectors automatically (winwright heal). For larger UI redesigns, ask the AI agent to update the script — still faster than rewriting tests from scratch.

Why this matters:

  • Save AI costs — the agent records once, scripts replay for free
  • Deterministic results — every run produces identical, reproducible outcomes
  • Easy maintenance — self-healing selectors and AI-assisted script repair

Contents

Quick Start

Install, configure your MCP client, then ask the agent to do something:

"Launch Notepad, type 'Hello from WinWright', then read back what you typed."

The agent calls WinWright tools and returns results:

ww_launch    → { "processId": 12840, "mainWindowTitle": "Untitled - Notepad" }
ww_type      → { "success": true }
ww_get_value → { "value": "Hello from WinWright" }

Every tool returns structured JSON. The agent decides which tools to call and in what order — you describe the goal in plain language.

Install

Claude Code Plugin

From inside Claude Code, add the marketplace and install:

/plugin marketplace add civyk-official/civyk-winwright
/plugin install winwright@civyk-winwright

The plugin's install script downloads the latest binary automatically.

Note: WinWright has been submitted to the official Claude Code plugin directory and is pending review. Until approved, use the marketplace commands above to install.

Binary Download

Download from GitHub Releases:

Asset Architecture
winwright-*-win-x64.zip Intel/AMD 64-bit
winwright-*-win-arm64.zip ARM64 (Surface Pro, etc.)

MCP Client Configuration

Claude Code / VSCode (stdio)

{
  "servers": {
    "winwright": {
      "type": "stdio",
      "command": "C:/path/to/Civyk.WinWright.Mcp.exe",
      "args": ["mcp"]
    }
  }
}

Claude Code / VSCode (HTTP)

Start the server first: Civyk.WinWright.Mcp.exe serve --port 8765

{
  "servers": {
    "winwright": {
      "type": "http",
      "url": "http://localhost:8765/mcp"
    }
  }
}

Claude Desktop

{
  "mcpServers": {
    "winwright": {
      "command": "C:/path/to/Civyk.WinWright.Mcp.exe",
      "args": ["mcp"]
    }
  }
}

Use Cases

Each card links to a detailed walkthrough with real prompts, tool call parameters, and example output. Browse all guides in docs/use-cases/.

Scripted UI Test Automation for CI

Record an AI session once — the agent discovers the UI, performs actions, embeds assertions — then export a portable JSON script that replays in CI without an AI agent. Describe your app or paste your existing manual test suite; the agent scripts it automatically.

Autonomous Desktop Automation

Give an AI agent access to your desktop. It launches apps, moves data between them, fills forms, and takes screenshots for verification — no scripts to write or maintain.

Legacy App Data Extraction

Many enterprise apps have no API. If Windows UI Automation can see a control, WinWright can read its value. Extract data from apps that were never built for integration.

Scripted Desktop Automation for Repeated Tasks

Record a repetitive daily workflow once. Export as an RPA script and replay on demand — no AI agent required after the recording. Ideal for report exports, data imports, and any multi-step task that runs the same way every time.

AI-Powered UI Testing

An AI agent explores your WinForms or WPF app, finds elements, and asserts state. No brittle XPath selectors to maintain — the agent adapts when UI changes.

Bulk Data Validation

Drive an app through 50+ records automatically. Compare each displayed value against a reference table and get a structured pass/fail report with discrepancy details.

Cross-App Workflows

Automate workflows that span desktop apps and browser — read from an accounting app, submit to a web portal, screenshot the confirmation.

Application Health Monitoring

Verify a running app is alive and responsive — process running, connection status showing 'Connected', service healthy. Pair with Windows Task Scheduler for scheduled checks.

Remote Administration

Manage processes, services, registry, and scheduled tasks on remote machines over HTTP. Five-layer security: IP allowlist, Windows Negotiate auth, AD group authorization, rate limiting, and per-user session limits.

Accessibility Auditing

Traverse the full UIA element tree. Check that controls have names, buttons have labels, and keyboard paths exist. The AI agent generates a compliance report.

Dialog and Modal Handling

Detect unexpected confirmation dialogs, file-save prompts, and Win32 MessageBox popups after every click. Handle or dismiss them without breaking the automation flow.

Tools

~59 consolidated tools across five categories (merged from 110 via action/mode parameters):

Category Tools What it does
Desktop Automation ~30 Launch apps, click, type, read values, screenshots, tree navigation, dialogs, test case recording, CI script export (UIA3)
System ~12 Processes, registry, environment variables, file system, network, services, scheduled tasks
Browser 4 Chrome/Edge via CDP — sessions, pages, elements, advanced (eval/forms/dialogs). No Selenium dependency
AI Agent ~10 Snapshots, state diffing, event watching, action recording, ww_get_schema for tool discovery
Security Runtime permission guards with AD group overrides, JSONL audit logging

Each tool supports multiple actions via an action parameter (e.g., ww_service(action="list"), ww_service(action="start")), reducing the total tool count while maintaining full functionality.

Configuration

Create winwright.json next to the binary (or %APPDATA%\WinWright\winwright.json):

{
  "permissions": {
    "allowShell": false,
    "allowRegistryWrite": false,
    "allowProcessKill": false,
    "allowFileWrite": false,
    "allowServiceControl": false,
    "allowTaskScheduler": false,
    "allowEnvironmentWrite": false,
    "allowBrowserEval": false,
    "allowNetworkProbe": true
  },
  "audit": {
    "enabled": true,
    "logPath": "audit.jsonl"
  }
}

All dangerous operations are disabled by default. Enable only what you need.

CLI

winwright mcp                                    Start MCP server (stdio)
winwright serve --port N                         Start MCP server (HTTP, default 8765)
winwright run <script.json> [--format text|junit] [--output <file>]
                                                 Replay a recorded automation script
winwright heal <script.json> [--app <path>|--pid <n>] [--output <file>] [--min-confidence <0-1>]
                                                 Probe broken selectors against a live UI and repair them
winwright inspect <pid>                          Dump UIA element tree for a process
winwright doctor                                 Verify environment prerequisites

Requirements

  • Windows 10 or 11 (x64 or ARM64)
  • No .NET runtime needed for the binary download — it's self-contained

Who Is This For

Good fit:

  • QA engineers testing WinForms, WPF, or Win32 apps who want AI-assisted test creation
  • Developers building AI agents that need to interact with the Windows desktop
  • Teams extracting data from legacy enterprise apps that have no API
  • Anyone automating repetitive multi-app workflows on Windows

Not a good fit:

  • Linux or macOS automation — WinWright is Windows-only (UIA is a Windows API)
  • Web-only testing — use Playwright instead; WinWright's browser tools are for mixed desktop+browser workflows
  • High-throughput data pipelines — UIA reads controls one at a time; if you need bulk data transfer, a proper API or database connection is better

How It Compares

WinWright UiPath Power Automate Desktop Playwright
What it automates Desktop + browser + system Desktop + browser + system Desktop + browser + cloud Browser only
How you use it AI agent via MCP (natural language) Visual workflow designer Visual workflow designer Code (JS/Python/C#)
Desktop support WPF, WinForms, Win32 (UIA3) WPF, WinForms, Win32, Java, SAP WPF, WinForms, Win32 None
Browser support Chrome/Edge via CDP Chrome, Edge, Firefox Chrome, Edge, Firefox Chrome, Edge, Firefox, Safari
Selector model AI picks elements by name/type Visual selector recorder Visual selector recorder CSS/XPath selectors
Cost Free Licensed (per-user/bot) Free (desktop), licensed (cloud) Free
Setup Single binary, no runtime Full install + studio Windows store app npm install
Designed for AI agents and MCP clients Enterprise RPA Business user automation Developer testing

WinWright is not an RPA platform. It's a tool server that gives AI agents access to Windows. If you need a visual workflow builder or enterprise orchestration, UiPath or Power Automate are better choices. If you need browser-only testing, Playwright is more mature.

WinWright fits where those tools don't — when an AI agent needs to see and operate the Windows desktop, or when you need desktop + browser in one MCP session.

Support

Help keep this project alive and growing!

If WinWright has helped your development workflow, consider supporting its continued development. Your contribution helps with:

  • Ongoing maintenance and bug fixes
  • New feature development
  • Infrastructure costs

50% of all donations go directly to children's charities helping those in need. The remaining funds support project maintenance and feature upgrades.

Buy Me a Coffee Ko-fi

Every contribution, no matter the size, makes a difference.

License

Free to use for any purpose — personal, academic, commercial. See LICENSE for full terms. Attribution required when redistributing.


Built on Trust, Driven by ValueCivyk

推荐服务器

Baidu Map

Baidu Map

百度地图核心API现已全面兼容MCP协议,是国内首家兼容MCP协议的地图服务商。

官方
精选
JavaScript
Playwright MCP Server

Playwright MCP Server

一个模型上下文协议服务器,它使大型语言模型能够通过结构化的可访问性快照与网页进行交互,而无需视觉模型或屏幕截图。

官方
精选
TypeScript
Magic Component Platform (MCP)

Magic Component Platform (MCP)

一个由人工智能驱动的工具,可以从自然语言描述生成现代化的用户界面组件,并与流行的集成开发环境(IDE)集成,从而简化用户界面开发流程。

官方
精选
本地
TypeScript
Audiense Insights MCP Server

Audiense Insights MCP Server

通过模型上下文协议启用与 Audiense Insights 账户的交互,从而促进营销洞察和受众数据的提取和分析,包括人口统计信息、行为和影响者互动。

官方
精选
本地
TypeScript
VeyraX

VeyraX

一个单一的 MCP 工具,连接你所有喜爱的工具:Gmail、日历以及其他 40 多个工具。

官方
精选
本地
graphlit-mcp-server

graphlit-mcp-server

模型上下文协议 (MCP) 服务器实现了 MCP 客户端与 Graphlit 服务之间的集成。 除了网络爬取之外,还可以将任何内容(从 Slack 到 Gmail 再到播客订阅源)导入到 Graphlit 项目中,然后从 MCP 客户端检索相关内容。

官方
精选
TypeScript
Kagi MCP Server

Kagi MCP Server

一个 MCP 服务器,集成了 Kagi 搜索功能和 Claude AI,使 Claude 能够在回答需要最新信息的问题时执行实时网络搜索。

官方
精选
Python
e2b-mcp-server

e2b-mcp-server

使用 MCP 通过 e2b 运行代码。

官方
精选
Neon MCP Server

Neon MCP Server

用于与 Neon 管理 API 和数据库交互的 MCP 服务器

官方
精选
Exa MCP Server

Exa MCP Server

模型上下文协议(MCP)服务器允许像 Claude 这样的 AI 助手使用 Exa AI 搜索 API 进行网络搜索。这种设置允许 AI 模型以安全和受控的方式获取实时的网络信息。

官方
精选