Home / MCP / Replicate MCP Server
Provides access to Replicate models, predictions, and image handling via an MCP client.
Configuration
View docs{
"mcpServers": {
"replicate_mcp": {
"command": "mcp-replicate",
"args": [],
"env": {
"REPLICATE_API_TOKEN": "YOUR_TOKEN"
}
}
}
}You can run Replicate models through a lightweight MCP server that exposes a simple tool-based interface. This server lets you search models, run predictions, and handle images from Replicate directly in your MCP client. It’s useful for integrating Replicate’s capabilities into chat tools and automation workflows without leaving your client environment.
Connect your MCP client to the Replicate MCP server using the provided command and configuration. Once connected, you can browse models, run predictions with text or structured inputs, track and manage predictions, and view generated images in your browser. Use the available tools to search models, get detailed model information, create predictions, poll results, and manage image caches.
Prerequisites you need before starting are Node.js version 18 or newer and a working internet connection.
# Install the MCP server globally
npm install -g mcp-replicate
# Optional: run directly with npx without a global install
npx mcp-replicateConfiguration You must provide a Replicate API token for the server to operate. There are two ways to supply this token:
{
"mcpServers": {
"replicate": {
"command": "mcp-replicate",
"env": {
"REPLICATE_API_TOKEN": "your_token_here"
}
}
}
}Environment variable option If you are using another MCP client, you can set the token as an environment variable in your shell before starting the server.
export REPLICATE_API_TOKEN=your_token_hereIf tools are not visible or not functioning, verify the MCP client is configured with the correct server settings and that the Replicate API token is valid. Restart both the MCP client and the server if needed, and check for any error messages in the server output.
This server is an experimental integration aimed at enabling tool-based access to Replicate through MCP clients. It provides a set of model, prediction, and image handling capabilities and can be used with a variety of MCP clients that support standard stdio-based servers.
Find models using semantic search across available collections.
Browse the list of available models.
Retrieve detailed information about a specific model, including versions.
Browse model collections to discover related assets.
Get details about a specific collection, including its models.
Run a model with your inputs to generate a prediction.
Run a prediction and wait until it completes, returning the result.
Check the status and result of a previously started prediction.
Cancel an ongoing prediction.
List your recent predictions for quick re-access.
Open a generated image in your browser.
Remove cached images to reclaim space or refresh visuals.
Check statistics for the image cache usage.