home / mcp / pdf splitter mcp server
Provides random access to PDF contents by loading PDFs and extracting pages, searching text, and rendering pages.
Configuration
View docs{
"mcpServers": {
"espresso3389-pdf-splitter-mcp": {
"command": "bun",
"args": [
"run",
"/full/path/to/pdf-splitter-mcp/src/index.ts"
]
}
}
}PDF Splitter MCP Server provides random access to PDF contents, enabling selective extraction of pages, text, and images while minimizing processing and token costs. You integrate it with an MCP client to load PDFs, run page-level operations, and render pages as images for analysis, previews, or OCR workflows.
You enable and use the PDF Splitter MCP Server by connecting an MCP client that can talk to an MCP endpoint. Load a PDF to memory, then perform targeted actions such as extracting a single page, extracting a range, searching for text, listing images, or rendering pages as images. You can load multiple PDFs and switch between them using their IDs.
Typical usage flow:
To configure an MCP client for this server, provide a stdio connection that launches the MCP server locally. The following configuration is an example for Gemini CLI, using Bun to run the TypeScript entry point directly.
{
"mcpServers": {
"pdf_splitter": {
"command": "bun",
"args": ["run", "/full/path/to/pdf-splitter-mcp/src/index.ts"]
}
}
}Load a PDF file into memory (supports URLs) and return a PDF ID with its page count.
Extract content from a specific page using the loaded PDF ID and page number.
Extract content from a range of pages given a PDF ID, startPage, and endPage.
Search for text within the PDF with optional case sensitivity and regex support.
Retrieve metadata and information about a loaded PDF.
List all PDFs currently loaded in memory along with their IDs and page counts.
Extract the document outline or table of contents with page references.
List all images in the PDF with metadata such as page, index, dimensions, and format.
Extract images as base64 data and optionally save them to files with an output path pattern.
Extract a specific image by page and image index, returning base64 data or saving to a file.
Render a single PDF page as an image at a chosen DPI and format.
Render multiple pages as images in batch, with options for DPI, format, and output saving.