home / mcp / databeak mcp server
Model Context Protocol server providing 40+ specialized CSV data manipulation tools for AI assistants. Enables seamless data loading, transformation, analysis, validation, and export workflows with session management and auto-save capabilities.
Configuration
View docs{
"mcpServers": {
"jonpspri-databeak": {
"command": "uvx",
"args": [
"--from",
"git+https://github.com/jonpspri/databeak.git",
"databeak"
],
"env": {
"DATABEAK_MAX_ROWS": "YOUR_VALUE",
"DATABEAK_SESSION_TIMEOUT": "YOUR_VALUE",
"DATABEAK_MAX_MEMORY_USAGE_MB": "YOUR_VALUE",
"DATABEAK_URL_TIMEOUT_SECONDS": "YOUR_VALUE",
"DATABEAK_MAX_DOWNLOAD_SIZE_MB": "YOUR_VALUE",
"DATABEAK_HEALTH_MEMORY_THRESHOLD_MB": "YOUR_VALUE"
}
}
}
}DataBeak is an MCP server that lets you process and analyze CSV data through a flexible, AI-assisted interface. It supports loading data from URLs or string content, performing transformations, running analytics, validating data, and managing sessions securely in a stateless design. You interact with it via an MCP client, enabling powerful data workflows in natural language or structured prompts without exposing the underlying data handling details.
Set up your MCP client to connect to DataBeak using the provided runtime configuration. You will run the server through a standard MCP pattern that keeps the external context separate and supports multiple concurrent sessions. Use the following approach to start and interact with DataBeak in your client.
Prerequisites: you need a runtime capable of executing MCP server configurations. The fastest way to begin is to use the runtime that DataBeak provides for MCP integration.
1. Create a settings file for your MCP client and add the server configuration block shown below. This config uses a stdio runtime to load the MCP server from a remote source.
{
"mcpServers": {
"databeak": {
"command": "uvx",
"args": [
"--from",
"git+https://github.com/jonpspri/databeak.git",
"databeak"
]
}
}
}If you prefer HTTP-based access or a custom deployment, you can run DataBeak in HTTP mode and expose the MCP interface over a network port. The following command starts the server in HTTP mode and binds to all interfaces on port 8000.
uv run databeak --transport http --host 0.0.0.0 --port 8000Once DataBeak is running, you can ask your MCP client to load and analyze CSV data with simple prompts. For example you can load CSV content directly, or provide a URL for the data source, and then request operations such as removing duplicates or computing statistics. You can query for outliers in a numeric column and retrieve a data profile to understand data quality and distribution.
DataBeak is designed to be web-safe with no local file system access for hosted deployments. It supports asynchronous I/O and streaming downloads, with session isolation to support multi-user scenarios. Be mindful of predefined limits on downloads, memory, and rows, and adjust environment variables accordingly for production workloads.