Server data from the Official MCP Registry
Windows desktop-control MCP server: screenshot, window mgmt, mouse/keyboard input, recording.
Windows desktop-control MCP server: screenshot, window mgmt, mouse/keyboard input, recording.
Valid MCP server (0 strong, 4 medium validity signals). No known CVEs in dependencies. Imported from the Official MCP Registry. 1 finding(s) downgraded by scanner intelligence.
13 files analyzed · 1 issue found
Security scores are indicators to help you make informed decisions, not guarantees. Always review permissions before connecting any MCP server.
This plugin requests these system permissions. Most are normal for its category.
Set these up before or after installing:
Environment variable: DESKTOP_MCP_ENABLE_WINDOW
Environment variable: DESKTOP_MCP_ENABLE_INPUT
Environment variable: DESKTOP_MCP_ENABLE_RECORD
Environment variable: DESKTOP_MCP_RATE_LIMIT_PER_MIN
Environment variable: DESKTOP_MCP_SCRATCH_DIR
Environment variable: DESKTOP_MCP_LIVE
Add this to your MCP configuration file:
{
"mcpServers": {
"io-github-jaimenbell-desktop-mcp": {
"args": [
"desktop-mcp"
],
"command": "uvx"
}
}
}From the project's GitHub README.
Windows desktop-control MCP server: screenshot, window management, mouse/keyboard input, and ffmpeg screen-recording -- built to the same standard as mcp-factory and rag-mcp (own pyproject, fastmcp server, honest README, real test suite). Config-gated tool groups, input disabled by default.
| Group | Tools | Default state |
|---|---|---|
observe | screenshot, list_windows, get_active_window, window_info | always on |
window | focus_window, move_resize_window, minimize_window, restore_window | env-gated, off unless DESKTOP_MCP_ENABLE_WINDOW=1 |
input | mouse_move, mouse_click, mouse_drag, mouse_scroll, key_press, hotkey, type_text | env-gated, OFF by default -- requires DESKTOP_MCP_ENABLE_INPUT=1 |
record | record_start, record_status, record_stop | env-gated, off unless DESKTOP_MCP_ENABLE_RECORD=1 |
A disabled group returns a structured policy_refusal error (never a silent
no-op, never a crash). Input actions are additionally rate-capped (default 60
actions/min, tunable via DESKTOP_MCP_RATE_LIMIT_PER_MIN) -- exceeding the
cap returns a structured rate_limited error.
This is defense-in-depth: harness-level permission prompts are the first gate, but the server itself refuses input/window/record actions unless its own env explicitly enables them, so a misconfigured or overly-permissive harness can't turn on capabilities the operator didn't opt into for this process.
Every claim below maps to the file that implements it and the test(s) that verify it -- no capability is asserted without a corresponding implementation + test.
| Claim | Implementation | Verified by |
|---|---|---|
| Capture a screenshot (monitor / region / window) as PNG | desktop_mcp/groups/observe.py::screenshot | tests/test_observe.py::TestScreenshot, live: tests/test_live_smoke.py::test_live_screenshot_real_png |
| Enumerate windows / get active window / look up by title | desktop_mcp/groups/observe.py | tests/test_observe.py::TestListWindows, TestGetActiveWindow, TestWindowInfo |
| Focus / move+resize / minimize / restore a window | desktop_mcp/groups/window.py | tests/test_window.py |
| Mouse move/click/drag/scroll, key press/hotkey, type text | desktop_mcp/groups/input_tools.py | tests/test_input.py (mocked pyautogui only -- see Limitations) |
| Screen recording via ffmpeg gdigrab, hard duration cap, graceful stop | desktop_mcp/groups/record.py | tests/test_record.py, live: tests/test_live_smoke.py::test_live_record_real_mp4 |
| Input group OFF by default, structured refusal when disabled | desktop_mcp/config.py::group_enabled, gated | tests/test_config.py::TestGroupEnabled, tests/test_input.py::TestGateDisabledByDefault |
| Rate cap on input actions (default 60/min) | desktop_mcp/config.py::TokenBucket, RateLimiterRegistry | tests/test_config.py::TestTokenBucket, tests/test_input.py::TestRateLimit |
| DPI-awareness bootstrap (per-monitor-v2) so mss/pyautogui coords agree on scaled displays | desktop_mcp/config.py::ensure_dpi_awareness | tests/test_config.py::TestDpiAwareness (idempotency only; visual coord-agreement is not automated -- see Limitations) |
| Coordinate validation against virtual-desktop bounds before any mouse action | desktop_mcp/groups/input_tools.py::_validate_point | tests/test_input.py (out-of-bounds cases) |
| Orphan-guard: a stale recorder from a crashed process gets killed before a new one starts | desktop_mcp/groups/record.py::_orphan_guard | tests/test_record.py::TestRecordStart::test_orphan_guard_kills_stale_recorder |
window_action_failed /
input_failed error naming UIPI, never a silent no-op -- but there is no
workaround short of running the server elevated, which this project does
not do or recommend.mss pixel coordinates and pyautogui point coordinates should agree on
scaled displays. This bootstrap is unit-tested for idempotency/no-crash
only -- actual coordinate agreement on a live multi-DPI multi-monitor setup
has not been automated-tested and should be spot-checked if you're
targeting a non-100%-scaled monitor.~/.claude.json's desktop-mcp entry -- DESKTOP_MCP_ENABLE_INPUT is not
set there. Enabling it is a deliberate per-registration operator choice,
not a code change.FAILSAFE=True is intentional: slamming the cursor
into a screen corner mid-action raises inside pyautogui and aborts the
call. This can interrupt an in-flight mouse_drag. Treated as acceptable
v1 behavior (see plan's Open questions) -- it's a deliberate human
kill-switch, not a bug.| Var | Effect | Default |
|---|---|---|
DESKTOP_MCP_ENABLE_WINDOW | enable the window tool group | unset (off) |
DESKTOP_MCP_ENABLE_INPUT | enable the input tool group | unset (off) |
DESKTOP_MCP_ENABLE_RECORD | enable the record tool group | unset (off) |
DESKTOP_MCP_RATE_LIMIT_PER_MIN | input-group rate cap | 60 |
DESKTOP_MCP_SCRATCH_DIR | where screenshots/recordings/pidfiles are written | %TEMP%\desktop-mcp-scratch |
DESKTOP_MCP_LIVE | 1 to run real-hardware smoke tests (see Testing) | unset (skip) |
// A tool call from the MCP host, illustrative -- not a shell command.
{"tool": "screenshot", "arguments": {"monitor": 0}}
// -> {"ok": true, "path": "C:\\Users\\...\\Temp\\desktop-mcp-scratch\\screenshot-....png", "w": 3840, "h": 1080, "monitor": 0}
{"tool": "record_start", "arguments": {"fps": 30, "max_duration_s": 30}}
// -> {"ok": true, "path": "...\\recording-....mp4", "pid": 12345, "fps": 30, "max_duration_s": 30}
{"tool": "record_stop", "arguments": {}}
// -> {"ok": true, "path": "...\\recording-....mp4", "bytes": 800560, "duration_s": 3.13}
// input group disabled (default):
{"tool": "mouse_click", "arguments": {"x": 500, "y": 500}}
// -> {"ok": false, "error": {"type": "policy_refusal", "group": "input", "required_env": "DESKTOP_MCP_ENABLE_INPUT", ...}}
# unit suite (mocked backends, no real screen/input/recording touched)
python -m pytest -q
# handshake check -- prints every registered tool name
python scripts/list_tools.py
# real-hardware smokes (real screenshot PNG, real ~3s screen recording;
# never input-injection -- see safety rails above)
DESKTOP_MCP_LIVE=1 python -m pytest -q -k live_screenshot
DESKTOP_MCP_LIVE=1 python -m pytest -q -k live_record
pip install -r requirements.txt # or: pip install .
# deps: fastmcp==3.4.2, mss==10.2.0, pyautogui==0.9.54, PyGetWindow==0.0.9, pywin32==312
# also requires ffmpeg + ffprobe on PATH for the record group
Registered in ~/.claude.json as desktop-mcp (stdio, system Python312,
observe+window+record groups enabled, input group absent from env).
Maintained by Jaimen Bell. For production MCP integrations, custom servers, or agent-reliability work, see jaimenbell.dev or sponsor ongoing maintenance via GitHub Sponsors.
mcp-name: io.github.jaimenbell/desktop-mcp
Be the first to review this server!
by Modelcontextprotocol · Developer Tools
Web content fetching and conversion for efficient LLM usage
by Modelcontextprotocol · Developer Tools
Read, search, and manipulate Git repositories programmatically
by Toleno · Developer Tools
Toleno Network MCP Server — Manage your Toleno mining account with Claude AI using natural language.