NEWTickerr MCP is live →
tickerr

Tickerr / Pricing / Replicate

Replicate

Replicate pricing - 2026

Operational

Replicate API starts at $0.0300/1M input tokens. Prices effective since May 27, 2026.

Prices effective since May 27, 2026. Verified May 27, 2026. Confirm at official pricing page before billing.

Prices shown are sourced from official Replicate documentation and updated automatically when changes are detected. Prices effective since May 27, 2026. Verified May 27, 2026. Always confirm current pricing at Replicate's official pricing page before making billing decisions. Tickerr is not affiliated with Replicate.

Cost calculator

Estimated monthly cost · 70% input / 30% output split

cheapestIbm Granite Granite 3.3.8b Instruct$0.096
Meta Llama 3.8b$0.110
Meta Llama 3.8b Instruct$0.110
Mistralai Mistral 7b Instruct V0.2$0.110
Mistralai Mistral 7b V0.1$0.110
Meta Llama 2.7b$0.110

+34 more models not shown

Price history

Input price per 1M tokens - tracked from Apr 5, 2026

Prices scraped daily from official provider documentation. Chart shows input token pricing.

Replicate usage limits by plan

FreeFree
Predictions50 predictionsFree credits for new users
ModelsUnlimitedPublic models only
PrivateNot availableNo private deployments
Gpu AccessNot availableLimited GPU on free
Pay-as-you-goFree
PredictionsUnlimitedPer-second GPU billing
ModelsUnlimitedAll public + private models
PrivateUnlimitedPrivate model deployments
Gpu AccessUnlimitedA100, H100, T4 available
Team$100/month
PredictionsUnlimitedShared billing
UsersUnlimited
PrivateUnlimited
Spend LimitUnlimitedConfigurable spend cap

Replicate features and capabilities

Generation

Streaming output✓ YesStream tokens in real-time
LLM inference✓ YesLlama, Mistral, etc.
ML model hosting✓ YesRun open-source models
Custom model deployment✓ YesDeploy your own Cog models
Image generation✓ YesSDXL, Flux, etc.
Video generation✓ YesWan, AnimateDiff, etc.

Integrations & API

REST API✓ YesSimple prediction API
Webhooks✓ YesAsync prediction callbacks

Related pages

About Replicate API pricing

Replicate API pricing is set by Replicate and billed per million tokens processed. Input tokens (your prompt) and output tokens (the response) are priced separately.

Prices on this page are sourced from official Replicate documentation and updated when Replicate announces pricing changes. Check the official Replicate pricing page for the most current rates.

Weekly AI pricing & uptime digest

Price drops, new model releases, and incident summaries - every Monday. Free.