Gemini rate limits, context window & usage caps (2026)
Gemini rate limits, context window and usage caps by plan. Gemini 1.5 Pro context window (1M tokens), message limits, file upload caps (2GB) - Free and Advanced plans compared.
Context Window
1000000
tokens · ~750K words
Plans
2
tiers tracked
API Tiers
2
rate limit tiers
Gemini usage limits by plan
| Messages / day | Limited | No published fixed limit |
| File Upload MB | 2048 MB | 2GB per file |
| Context Window | 1000000 tokens(~750K words) | Gemini 1.5 Pro context |
| Image Generation / day | Not available | Not available on Free |
| File Upload Size | 2048 MB | Per file |
| Gems Custom Ai | Not available | Not available on Free |
| Messages / day | Unlimited | No published cap |
| File Upload MB | 2048 MB | 2GB per file |
| Gems | Unlimited | Custom Gems available |
| Context Window | 1000000 tokens(~750K words) | Gemini 1.5 Pro context |
| File Upload Size | 2048 MB | Per file |
| Gems Custom Ai | Included | Create custom Gems |
| Image Generation / day | Unlimited | Imagen 3 included |
Gemini API rate limits by tier
API access uses a tiered rate limit system. Higher tiers unlock more requests per minute (RPM) and tokens per minute (TPM).
| Tier | RPM | TPM |
|---|---|---|
| Free (AI Studio) | 15 | 1,000,000 |
| Pay-as-you-go | 1,000 | 4,000,000 |
RPM = requests per minute · TPM = tokens per minute. Limits shown are approximate and may vary by model.
What happens when you hit Gemini's limits?
Check the reset window - most limits refresh within 1–60 minutes
Use exponential backoff: 1s → 2s → 4s up to 60s max
If you hit limits regularly, upgrade your plan to increase caps
Gemini limit reset schedule
⚡
Per minute
API RPM limits - reset every 60 seconds
🕐
Per hour
Short rolling windows for message quotas
⏱
Per 5 hours
Common for consumer plan message limits
📅
Per day / month
Image gen credits and file storage caps
Exact reset period per limit type is shown in the "Notes" column of the plan table above. Gemini uses rolling-window resets - quotas refresh continuously, not at a fixed midnight cutoff.
More Gemini intelligence
Live status →
Check if rate limit errors are due to an active outage
Pricing →
Compare Gemini plan costs and API token pricing
Free tier →
Compare free limits across all AI tools
Limits sourced from Google's official documentation. Updated when plan changes are announced.
Gemini limits - frequently asked questions
What is the Gemini message limit?
Gemini message limits vary by plan - see the full breakdown by tier in the table above.
Does Gemini have a file upload limit?
Yes, Gemini enforces file upload limits that vary by plan. See the detailed breakdown above.
When do Gemini limits reset?
Reset periods vary by limit type - many Gemini limits reset on a rolling window (e.g., per 5 hours or per 24 hours). Check the notes column in the table above for specific reset schedules.
What happens when you hit Gemini's rate limit?
Gemini Advanced shows a usage warning when approaching limits. The Gemini API returns HTTP 429 (RESOURCE_EXHAUSTED) with a retry delay. Free tier is capped at 15 RPM.
What is Gemini's context window?
Gemini's context window is 1000000 tokens (~750K words). This is the maximum amount of text - including your conversation history - the model can process in a single request.