home / mcp / mcp screenshot mcp server
An MCP server that captures screenshots and performs OCR text recognition for Japanese and English, producing outputs in multiple formats.
Configuration
View docs{
"mcpServers": {
"kazuph-mcp-screenshot": {
"command": "npx",
"args": [
"-y",
"@kazuph/mcp-screenshot"
],
"env": {
"OCR_API_URL": "http://localhost:8000"
}
}
}
}You run a specialized MCP server that captures screenshots and runs OCR to extract text in Japanese and English. It can output results in multiple formats and is designed to be invoked from an MCP client to automate screen capture and text recognition tasks.
To use this server from an MCP client, you start the server component and issue capture requests through the client. You can request a screenshot of the left half, right half, or the full screen, and you can choose the output format you want (JSON, Markdown, vertical, or horizontal). The OCR engine will handle recognition, prioritizing yomitoku when available and falling back to Tesseract.js if needed. When you issue a capture, you’ll receive the recognized text in the selected format.
Prerequisites: you need Node.js and npm installed on your system.
Install the MCP server package using npm via npx.
npx -y @kazuph/mcp-screenshotThe server relies on an OCR API endpoint for yomitoku. You supply the API base URL through an environment variable named OCR_API_URL. By default, this is set to http://localhost:8000.
Takes a screenshot and performs OCR. Options include region to choose left, right, or full, and format to output json, markdown, vertical, or horizontal.