io.github.TheoEwzZer/winsight

编码与调试

by theoewzzer

Windows 屏幕捕获 MCP Server,让 Claude Code 能直接看到你的桌面内容。

什么是 io.github.TheoEwzZer/winsight

Windows 屏幕捕获 MCP Server,让 Claude Code 能直接看到你的桌面内容。

README

WinSight MCP

mcp-name: io.github.TheoEwzZer/winsight

Windows Screen Capture MCP Server — give Claude Code eyes on your Windows desktop.

WinSight is an MCP server that lets Claude Code capture your screen, manage windows, and launch applications on Windows.

Features

  • Screenshot the full screen, a specific region, or a specific window
  • Window capture uses Win32 PrintWindow API — captures the real window content even when it's behind other windows
  • List and inspect open windows and monitors (title, position, size, state, resolution)
  • Control windows — move, resize, minimize, maximize, restore, and focus
  • Launch applications and wait for their windows

Requirements

  • Windows 10/11
  • Python 3.10+

Quick Start

Option 1: uvx (recommended)

No install needed — runs directly:

json
{
  "mcpServers": {
    "winsight": {
      "command": "uvx",
      "args": ["winsight-mcp"]
    }
  }
}

Add this to your project's .mcp.json or ~/.claude/claude_desktop_config.json.

Option 2: pip install

bash
pip install winsight-mcp

Then configure:

json
{
  "mcpServers": {
    "winsight": {
      "command": "winsight-mcp"
    }
  }
}

Option 3: From source

bash
git clone https://github.com/TheoEwzZer/WinSight-MCP.git
cd WinSight-MCP
uv sync
json
{
  "mcpServers": {
    "winsight": {
      "command": "uv",
      "args": ["--directory", "/path/to/WinSight-MCP", "run", "winsight-mcp"]
    }
  }
}

Tools

Screenshot

ToolDescription
take_screenshotCapture the full screen or a specific monitor
screenshot_windowCapture a specific window by title (works even if behind other windows)
screenshot_regionCapture a rectangular region of the screen

Window Management

ToolDescription
list_windowsList all visible windows with optional title filter
get_window_infoGet detailed info about a window (position, size, state)
focus_windowBring a window to the foreground
resize_windowResize a window to specific dimensions
move_windowMove a window to a specific position
minimize_windowMinimize a window to the taskbar
maximize_windowMaximize a window to fill the screen
restore_windowRestore a minimized or maximized window to its normal state
wait_for_windowWait for a window to appear (adaptive polling with timeout)

System

ToolDescription
list_monitorsList all monitors with resolution, position, and primary flag
open_applicationLaunch an application and optionally wait for its window

Examples

Once the MCP server is connected, you can ask Claude Code things like:

  • "Take a screenshot of my screen"
  • "List all open windows"
  • "Capture the Notepad window"
  • "Open calculator and take a screenshot of it"
  • "Focus the Chrome window"
  • "Resize the app window to 800x600 and take a screenshot"
  • "Move the window to the top-left corner"
  • "What monitors do I have?"

Testing

The project has 112 tests covering all modules. Tests use mocks for Win32 APIs so they run on any platform.

Running tests

bash
uv run pytest

Test structure

text
tests/
  conftest.py              # Shared fixtures and Win32 stubs
  test_types.py            # TypedDict definitions validation
  test_screenshot.py       # Screen/region/window capture (mss, Win32 DC)
  test_window_manager.py   # Window listing, find, focus, resize, move, min/max/restore
  test_process_manager.py  # Application launch and window polling
  test_server.py           # MCP tool registration and integration

Adding tests

  1. Put new tests in the matching test_<module>.py file
  2. Use the shared fixtures from conftest.py (sample_window_info, mcp_server, fake_png_bytes)
  3. Mock Win32 APIs with @patch("winsight_mcp.<module>.win32gui") — never call real Win32 functions in tests
  4. For server integration tests, use the _call helper to invoke tools and _text to extract string results

License

MIT

常见问题

io.github.TheoEwzZer/winsight 是什么?

Windows 屏幕捕获 MCP Server,让 Claude Code 能直接看到你的桌面内容。

相关 Skills

网页构建器

by anthropics

Universal
热门

面向复杂 claude.ai HTML artifact 开发,快速初始化 React + Tailwind CSS + shadcn/ui 项目并打包为单文件 HTML,适合需要状态管理、路由或多组件交互的页面。

在 claude.ai 里做复杂网页 Artifact 很省心,多组件、状态和路由都能顺手搭起来,React、Tailwind 与 shadcn/ui 组合效率高、成品也更精致。

编码与调试
未扫描114.1k

前端设计

by anthropics

Universal
热门

面向组件、页面、海报和 Web 应用开发,按鲜明视觉方向生成可直接落地的前端代码与高质感 UI,适合做 landing page、Dashboard 或美化现有界面,避开千篇一律的 AI 审美。

想把页面做得既能上线又有设计感,就用前端设计:组件到整站都能产出,难得的是能避开千篇一律的 AI 味。

编码与调试
未扫描114.1k

网页应用测试

by anthropics

Universal
热门

用 Playwright 为本地 Web 应用编写自动化测试,支持启动开发服务器、校验前端交互、排查 UI 异常、抓取截图与浏览器日志,适合调试动态页面和回归验证。

借助 Playwright 一站式验证本地 Web 应用前端功能,调 UI 时还能同步查看日志和截图,定位问题更快。

编码与调试
未扫描114.1k

相关 MCP Server

GitHub

编辑精选

by GitHub

热门

GitHub 是 MCP 官方参考服务器,让 Claude 直接读写你的代码仓库和 Issues。

这个参考服务器解决了开发者想让 AI 安全访问 GitHub 数据的问题,适合需要自动化代码审查或 Issue 管理的团队。但注意它只是参考实现,生产环境得自己加固安全。

编码与调试
83.4k

by Context7

热门

Context7 是实时拉取最新文档和代码示例的智能助手,让你告别过时资料。

它能解决开发者查找文档时信息滞后的问题,特别适合快速上手新库或跟进更新。不过,依赖外部源可能导致偶尔的数据延迟,建议结合官方文档使用。

编码与调试
52.2k

by tldraw

热门

tldraw 是让 AI 助手直接在无限画布上绘图和协作的 MCP 服务器。

这解决了 AI 只能输出文本、无法视觉化协作的痛点——想象让 Claude 帮你画流程图或白板讨论。最适合需要快速原型设计或头脑风暴的开发者。不过,目前它只是个基础连接器,你得自己搭建画布应用才能发挥全部潜力。

编码与调试
46.3k

评论