NEWTickerr MCP is live →
tickerr

Tickerr / Limits / Mistral

Mistral rate limits, context window & usage caps (2026)

Mistral AI API rate limits, context window and usage caps. Rate limits and context window sizes - Free and Pro plans compared.

Context Window

32000

tokens · ~24K words

Plans

2

tiers tracked

API Tiers

2

rate limit tiers

Mistral usage limits by plan

Rpm1 requests/minFree API tier
Tpm500000 tokens/minFree API tier
Context Window32000 tokens(~24K words)Mistral 7B Instruct
Max Tokens Output8192 tokensPer response
Models Available2 modelsMistral 7B, Mixtral 8x7B
Pro$14.99/month
View pricing →
Rpm60 requests/minPaid API tier
Tpm2000000 tokens/minPaid API tier
Context Window128000 tokens(~96K words)Mistral Large
Max Tokens Output8192 tokensPer response
Models Available5 modelsAll Mistral models

Mistral API rate limits by tier

API access uses a tiered rate limit system. Higher tiers unlock more requests per minute (RPM) and tokens per minute (TPM).

TierRPMTPM
Free1500,000
Paid202,000,000

RPM = requests per minute · TPM = tokens per minute. Limits shown are approximate and may vary by model.

What happens when you hit Mistral's limits?

⚠ Rate limit triggered: When you exceed Mistral's rate limits, new requests are temporarily blocked until your usage window resets. Consumer apps show an in-app warning; API integrations receive HTTP 429.
1Wait

Check the reset window - most limits refresh within 1–60 minutes

2Retry

Use exponential backoff: 1s → 2s → 4s up to 60s max

3Upgrade

If you hit limits regularly, upgrade your plan to increase caps

HTTP 429 · Retry-After header · exponential backoff · monitor x-ratelimit-remaining-requests

Mistral limit reset schedule

Per minute

API RPM limits - reset every 60 seconds

🕐

Per hour

Short rolling windows for message quotas

Per 5 hours

Common for consumer plan message limits

📅

Per day / month

Image gen credits and file storage caps

Exact reset period per limit type is shown in the "Notes" column of the plan table above. Mistral uses rolling-window resets - quotas refresh continuously, not at a fixed midnight cutoff.

More Mistral intelligence

Limits sourced from Mistral AI's official documentation. Updated when plan changes are announced.

Mistral limits - frequently asked questions

What is the Mistral message limit?

Mistral message limits vary by plan - see the full breakdown by tier in the table above.

Does Mistral have a file upload limit?

Yes, Mistral enforces file upload limits that vary by plan. See the detailed breakdown above.

When do Mistral limits reset?

Reset periods vary by limit type - many Mistral limits reset on a rolling window (e.g., per 5 hours or per 24 hours). Check the notes column in the table above for specific reset schedules.

What happens when you hit Mistral's rate limit?

Mistral will temporarily block new requests when you exceed your plan's limits. You may see an in-app message or receive an HTTP 429 response. Wait for the reset window to pass or upgrade your plan.

What is Mistral's context window?

Mistral's context window is 32000 tokens (~24K words). This is the maximum amount of text - including your conversation history - the model can process in a single request.