home / mcp / macos gui control mcp server
Provides full macOS GUI control via MCP, including mouse, keyboard, screen capture, and window management actions.
Configuration
View docs{
"mcpServers": {
"akira-papa-macos-gui-control-mcp": {
"command": "node",
"args": [
"/path/to/macOS-GUI-Control-MCP/server/index.js"
],
"env": {
"TIMEOUT": "30000",
"LOG_OPERATIONS": "true"
}
}
}
}You can remotely control macOS input devices and screen capture using this MCP server. It lets you perform mouse and keyboard actions, capture screenshots, manage windows, run AppleScript or shell commands, and more, all through a consistent MCP interface. This is especially useful for automating GUI tasks or building assistive workflows on macOS.
You interact with the MCP server through an MCP client. Start the server as described in the installation steps, then reference the available actions by name from your client. Typical usage includes clicking at coordinates, typing text, taking screenshots, listing windows, activating apps, resizing windows, and running auxiliary commands. Commands are expressed using the tool names shown in the action list, and you can combine multiple actions in sequences to automate complex GUI tasks.
Prerequisites you need before installing the MCP server are Node.js and a working macOS environment with the necessary permissions.
Install the build toolchain and MCP packager, then package the MCP server for distribution.
Install dependencies and package the MCP server.
This server requires the following environment variables to control timeouts and logging behavior. You can set these as shown in the MCP configuration example.
Click at specified coordinates on the screen.
Perform a double-click at the given coordinates.
Right-click at the specified coordinates.
Move the cursor to the specified coordinates.
Drag the cursor from one coordinate to another.
Scroll the mouse wheel in up/down/left/right directions.
Retrieve the current cursor position.
Type a string of text into the focused input.
Press a single key, with optional modifier keys.
Execute a common keyboard shortcut like copy/paste/save.
Take a full-screen or region-based screenshot.
Get the current screen resolution.
Get the color value of a pixel at a coordinate.
List currently open windows.
Activate a specific application window.
Move a window to new coordinates.
Resize a window to specified dimensions.
Execute an AppleScript script.
Run a shell command and return its output.