Replicate pricing - 2026
Replicate API starts at $0.0300/1M input tokens. Prices effective since May 27, 2026.
Prices effective since May 27, 2026. Verified May 27, 2026. Confirm at official pricing page before billing.
Cost calculator
Estimated monthly cost · 70% input / 30% output split
+34 more models not shown
Price history
Input price per 1M tokens - tracked from Apr 5, 2026
Prices scraped daily from official provider documentation. Chart shows input token pricing.
Replicate usage limits by plan
| Predictions | 50 predictions | Free credits for new users |
| Models | Unlimited | Public models only |
| Private | Not available | No private deployments |
| Gpu Access | Not available | Limited GPU on free |
| Predictions | Unlimited | Per-second GPU billing |
| Models | Unlimited | All public + private models |
| Private | Unlimited | Private model deployments |
| Gpu Access | Unlimited | A100, H100, T4 available |
| Predictions | Unlimited | Shared billing |
| Users | Unlimited | |
| Private | Unlimited | |
| Spend Limit | Unlimited | Configurable spend cap |
Replicate features and capabilities
Generation
| Streaming output | ✓ Yes | Stream tokens in real-time | |
| LLM inference | ✓ Yes | Llama, Mistral, etc. | |
| ML model hosting | ✓ Yes | Run open-source models | |
| Custom model deployment | ✓ Yes | Deploy your own Cog models | |
| Image generation | ✓ Yes | SDXL, Flux, etc. | |
| Video generation | ✓ Yes | Wan, AnimateDiff, etc. |
Integrations & API
| REST API | ✓ Yes | Simple prediction API | |
| Webhooks | ✓ Yes | Async prediction callbacks |
Related pages
About Replicate API pricing
Replicate API pricing is set by Replicate and billed per million tokens processed. Input tokens (your prompt) and output tokens (the response) are priced separately.
Prices on this page are sourced from official Replicate documentation and updated when Replicate announces pricing changes. Check the official Replicate pricing page for the most current rates.
Weekly AI pricing & uptime digest
Price drops, new model releases, and incident summaries - every Monday. Free.