home / mcp / windows-mcp mcp server
Provides an open-source bridge for AI agents to control Windows desktop tasks via an MCP server.
Configuration
View docs{
"mcpServers": {
"cursortouch-windows-mcp-server": {
"command": "uv",
"args": [
"--directory",
"<path to the windows-mcp directory>",
"run",
"main.py"
]
}
}
}Windows-MCP is a lightweight, open-source server that connects AI agents to the Windows operating system. It enables agents to navigate files, control applications, interact with the UI, QA test, and automate a wide range of Windows tasks with real-time responsiveness.
You use Windows-MCP by connecting your MCP-compatible client to the local MCP server instance. The server exposes a set of tools that let your agent click, type, scroll, drag, move the mouse, press keyboard shortcuts,Wait, capture window and desktop state, launch apps, and run PowerShell commands. Your agent can then orchestrate multi-step workflows across Windows UI elements, automate QA tests, or perform routine maintenance tasks. Start by configuring the client to point at the local server and then issue high-level actions like “open File Explorer,” “type search text,” or “activate a window” to drive your automation flow.
Prerequisites you need before installing Windows-MCP are Python 3.13 or newer and the UV package manager from Astra. Install UV with one of these options.
Option A: Install UV directly with Python package manager.
pip install uvInstall Claude Desktop and then install the MCP packager, build the extension, and install it in Claude Desktop.
npm install -g @anthropic-ai/mcpbClone the Windows-MCP repository and build the Desktop Extension MCPB.
git clone https://github.com/CursorTouch/Windows-MCP.git
cd Windows-MCP
npx @anthropic-ai/mcpb packOpen Claude Desktop and install the extension through Settings > Extensions > Advance Settings > Install Extension, selecting the .mcpb file.
Install Perplexity Desktop, clone the Windows-MCP repository, then configure the connector to run the local MCP server.
git clone https://github.com/CursorTouch/Windows-MCP.git
cd Windows-MCPIn Perplexity Desktop, go to Settings > Connectors > Add Connector > Advanced. Use the following config to run the local server.
{
"name": "windows-mcp",
"command": "uv",
"args": [
"--directory",
"<path to the windows-mcp directory>",
"run",
"main.py"
]
}Install Gemini CLI, clone the repository, and add the MCP config to Gemini’s settings so it can start the local server.
npm install -g @google/gemini-cliEdit Gemini settings to include the Windows-MCP config.
{
"theme": "Default",
//MCP Server Config
"mcpServers": {
"windows-mcp": {
"command": "uv",
"args": [
"--directory",
"<path to the windows-mcp directory>",
"run",
"main.py"
]
}
}
}Install Qwen Code, clone the repository, and configure settings to run the local MCP server.
npm install -g @qwen-code/qwen-code@latestIn Qwen Code settings, add Windows-MCP as shown.
{
//MCP Server Config
"mcpServers": {
"windows-mcp": {
"command": "uv",
"args": [
"--directory",
"<path to the windows-mcp directory>",
"run",
"main.py"
]
}
}
}Install Codex CLI, clone the repository, and configure the Codex config to run the local MCP server.
npm install -g @openai/codexAdd the Windows-MCP config to Codex config.toml.
[mcp_servers.windows-mcp]
command="uv"
args=[
"--directory",
"<path to the windows-mcp directory>",
"run",
"main.py"
]Click on screen at specified coordinates to interact with UI elements.
Type text into a focused element, with optional clearing of existing text.
Scroll vertically or horizontally within a window or region.
Drag the cursor from one point to another.
Move the mouse pointer to a target location.
Simulate pressing keyboard shortcuts like Ctrl+C or Alt+Tab.
Pause execution for a defined duration.
Capture a snapshot of the system state including language, active apps, and UI components.
Launch, resize, and switch between applications from the start menu.
Execute PowerShell commands.
Scrape information from webpages or UI elements.