Replicate
Replicate lets you run machine learning models in the cloud with a simple API. Thousands of open-source models are available, and you can run your own custom models at scale. Run image generation, language models, audio synthesis, video generation, and more with a few lines of code. Replicate makes AI accessible to every software engineer.
1 APIs
16 Features
Artificial IntelligenceMachine LearningImage GenerationLanguage ModelsModel Deployment
APIs
Replicate
Replicate lets you run machine learning models in the cloud with a simple REST API. Access thousands of open-source models for image generation, language modeling, audio synthes...
Features
T4 GPU at $0.000225/sec (cheapest)
L40S GPU at $0.000975/sec
A100 80GB at $0.00140/sec
H100 at $0.001525/sec (highest performance)
Pay only for execution time (per second)
Default 10 predictions/sec; can be raised to 100 on paid
Other endpoints: 60 req/sec
Public model library with thousands of models
Cog framework for packaging your own models
Deployments for low-latency inference (charges idle time)
Webhooks for prediction completion
OAuth 2.0 and API tokens
Streaming output for LLM models
Files input via signed URLs
Training service for fine-tuning
Trainings billed at hardware rate
Event Specifications
Replicate Streaming and Webhooks API
AsyncAPI definition for Replicate's event-driven surfaces: - Server-Sent Events (SSE) stream returned for predictions where the model supports streaming output. The stream URL i...
ASYNCAPISemantic Vocabularies
API Governance Rules
Resources
🔗
PostmanWorkspace
PostmanWorkspace
🔗
ArazzoWorkflows
ArazzoWorkflows
🔗
LinkedIn
LinkedIn
🔗
Website
Website
🔗
Documentation
Documentation
💰
Pricing
Pricing
📰
Blog
Blog
📄
ChangeLog
ChangeLog
📜
TermsOfService
TermsOfService
📜
PrivacyPolicy
PrivacyPolicy
📝
SignUp
SignUp
🔗
Login
Login
🔗
Playground
Playground
👥
GitHubOrganization
GitHubOrganization
📦
SDKs
SDKs
📦
Python SDK
Python SDK
📦
Node.js SDK
Node.js SDK
📦
Go SDK
Go SDK
📦
Swift SDK
Swift SDK
🔗
Cog
Cog
🟢
StatusPage
StatusPage
🔗
MCPServer
MCPServer
🔗
AgentSkill
AgentSkill
🔗
LLMsTxt
LLMsTxt