NEWTickerr MCP is live →
tickerr

Tickerr / Limits / Cohere Command R

Cohere Command R rate limits, context window & usage caps (2026)

Cohere API rate limits and context window. Rate limits and usage caps for Command R and Command R+ models.

Context Window

128000

tokens · ~96K words

Plans

2

tiers tracked

API Tiers

2

rate limit tiers

Cohere Command R usage limits by plan

Rate Limit20 requests/minFree tier rate limit
Context Window128000 tokens(~96K words)
Embedding Dimensions1024 dimensionsEmbed v3 dimensions
Supported Languages100+ languages
Pay-as-you-goFree
View pricing →
Rate Limit10000 requests/minProduction tier
Context Window128000 tokens(~96K words)

Cohere Command R API rate limits by tier

API access uses a tiered rate limit system. Higher tiers unlock more requests per minute (RPM) and tokens per minute (TPM).

TierRPMTPM
Trial10n/a
ProductionUnlimitedn/a

RPM = requests per minute · TPM = tokens per minute. Limits shown are approximate and may vary by model.

What happens when you hit Cohere Command R's limits?

⚠ Rate limit triggered: When you exceed Cohere Command R's rate limits, new requests are temporarily blocked until your usage window resets. Consumer apps show an in-app warning; API integrations receive HTTP 429.
1Wait

Check the reset window - most limits refresh within 1–60 minutes

2Retry

Use exponential backoff: 1s → 2s → 4s up to 60s max

3Upgrade

If you hit limits regularly, upgrade your plan to increase caps

HTTP 429 · Retry-After header · exponential backoff · monitor x-ratelimit-remaining-requests

Cohere Command R limit reset schedule

Per minute

API RPM limits - reset every 60 seconds

🕐

Per hour

Short rolling windows for message quotas

Per 5 hours

Common for consumer plan message limits

📅

Per day / month

Image gen credits and file storage caps

Exact reset period per limit type is shown in the "Notes" column of the plan table above. Cohere Command R uses rolling-window resets - quotas refresh continuously, not at a fixed midnight cutoff.

More Cohere Command R intelligence

Limits sourced from Cohere's official documentation. Updated when plan changes are announced.

Cohere Command R limits - frequently asked questions

What is the Cohere Command R message limit?

Cohere Command R message limits vary by plan - see the full breakdown by tier in the table above.

Does Cohere Command R have a file upload limit?

Yes, Cohere Command R enforces file upload limits that vary by plan. See the detailed breakdown above.

When do Cohere Command R limits reset?

Reset periods vary by limit type - many Cohere Command R limits reset on a rolling window (e.g., per 5 hours or per 24 hours). Check the notes column in the table above for specific reset schedules.

What happens when you hit Cohere Command R's rate limit?

Cohere Command R will temporarily block new requests when you exceed your plan's limits. You may see an in-app message or receive an HTTP 429 response. Wait for the reset window to pass or upgrade your plan.

What is Cohere Command R's context window?

Cohere Command R's context window is 128000 tokens (~96K words). This is the maximum amount of text - including your conversation history - the model can process in a single request.