home / mcp / streamsets mcp server
Provides an MCP server to manage and build StreamSets pipelines via Control Hub APIs.
Configuration
View docs{
"mcpServers": {
"bracta-streamsets-mcp-server": {
"command": "python",
"args": [
"/path/to/streamsets_server.py"
],
"env": {
"STREAMSETS_CRED_ID": "your-credential-id",
"PIPELINE_STORAGE_PATH": "/var/lib/mcp/pipelines",
"STREAMSETS_CRED_TOKEN": "your-auth-token",
"STREAMSETS_HOST_PREFIX": "https://your-instance.streamsets.com"
}
}
}
}You can run a StreamSets MCP Server to manage and build StreamSets pipelines through conversational interactions. It connects to StreamSets Control Hub APIs, letting you list, monitor, start, stop, and analyze pipelines and jobs, manage connections, and build pipelines via a guided, interactive experience. It also supports persistent pipeline-building sessions so you can continue across conversations and devices.
You will use an MCP client to connect to the StreamSets MCP Server and perform both read and write operations. Start by authenticating with your API credentials, then choose the action you want to perform: list or monitor jobs, browse pipelines, manage connections, view metrics, or begin an interactive pipeline building session. You can persist your pipeline-building session across conversations and reload your progress later.
Prerequisites and setup steps are as follows.
{
"step": "Install and run the MCP server locally"
}1) Prepare the environment and install Python dependencies.
pip install -r requirements.txt2) Configure environment variables for StreamSets access.
export STREAMSETS_HOST_PREFIX="https://your-instance.streamsets.com"
export STREAMSETS_CRED_ID="your-credential-id"
export STREAMSETS_CRED_TOKEN="your-auth-token"3) Start the MCP server in the foreground to test connectivity.
python streamsets_server.pyIf you prefer containerized deployment, use Docker to run the server with persistence for pipeline builders.
# Build the image
docker build -t streamsets-mcp-server .
# Create a persistent volume for pipeline builders
docker volume create streamsets-pipeline-data
# Run with persistence and environment variables
docker run --rm -it \
-e STREAMSETS_HOST_PREFIX="https://your-instance.streamsets.com" \
-e STREAMSETS_CRED_ID="your-credential-id" \
-e STREAMSETS_CRED_TOKEN="your-auth-token" \
-v streamsets-pipeline-data:/data \
streamsets-mcp-serverYou can connect Claude Desktop to the MCP Server either as a local development setup or in production with Docker persistence.
{
"mcpServers": {
"streamsets": {
"command": "python",
"args": ["/path/to/streamsets_server.py"],
"env": {
"STREAMSETS_HOST_PREFIX": "https://your-instance.streamsets.com",
"STREAMSETS_CRED_ID": "your-credential-id",
"STREAMSETS_CRED_TOKEN": "your-auth-token"
}
}
}
}Alternatively, run in production with Docker persistence.
{
"mcpServers": {
"streamsets": {
"command": "docker",
"args": [
"run", "--rm", "-i",
"-v", "streamsets-pipeline-data:/data",
"-e", "STREAMSETS_HOST_PREFIX=https://your-instance.streamsets.com",
"-e", "STREAMSETS_CRED_ID=your-credential-id",
"-e", "STREAMSETS_CRED_TOKEN=your-auth-token",
"streamsets-mcp-server"
]
}
}
}List all jobs for an organization and filter by status or other criteria.
Retrieve detailed information about a specific job by its ID.
Start a single job by its ID.
Stop a running job by its ID.
Start multiple jobs in one request by listing their IDs.
Search pipelines by name or query to locate specific configurations.
Fetch detailed information about a specific pipeline.
Export pipelines or commits for backup or migration.
Retrieve performance metrics for a given job.
Get a count of jobs by their current status.
Access metrics for executor components like collectors.
Retrieve security audit metrics and logs for an organization.