What is Groq's uptime?

Groq has had 97.5% uptime over the last 90 days with 20 incidents recorded. Data sourced from Tickerr's automated monitoring at tickerr.ai/status/groq.

How do I check if Groq is having an outage?

Visit tickerr.ai/status/groq for real-time Groq status. Tickerr checks Groq every 5 minutes via automated monitoring. The page shows current status, response time, 90-day uptime chart, and full incident history. If Groq shows Operational but you have issues, the problem may be account-specific.

Why is Groq not working?

If Groq is not working, check the current status at tickerr.ai/status/groq. If status is Operational, the issue is likely account-specific or a local network problem. If status shows Down or Degraded, it is a confirmed widespread outage - check the incident history for details and estimated resolution.

What is Groq's current time-to-first-token (TTFT)?

Tickerr measures Groq API TTFT (time-to-first-token) every 5 minutes via authenticated API calls. Data appears after the first check runs. Visit tickerr.ai/status/groq for live readings.

Which Groq model has the fastest response time?

Based on 24-hour median TTFT, the fastest Groq model is llama-3.3-70b-versatile at 226ms TTFT. Tickerr measures all models every 5 minutes via authenticated API calls.

Tickerr / Status / Groq

Is Groq down right now?

Q: Is Groq down right now?

Groq is currently operational with a 337ms response time. Last checked by Tickerr 2026-07-11T23:15:36.352Z. 90-day uptime: 97.5%. Tickerr monitors Groq every 5 minutes.

Authenticated API inference - 2 models monitored · How we classify outages

Groq is currently operational - 337ms HTTP response. Last checked . 90-day uptime: 97.5%. Groq API: all 2 models responding - fastest TTFT 226ms.

Operational337ms response

Stay informed

HTTP uptime (90d)

97.5%

20 incidents (90d)

HTTP response now

337ms

HTTP p50 (7d)

385ms

median ping response

HTTP p95 (7d)

1606ms

tail ping response

API Inference Monitoring

Live · every 5 min

Best TTFT (p50)

226ms

time to first token

Best throughput

1579tok/s

output tokens/sec (24h avg)

Min success rate

100%

worst model (24h)

ModelStatusTTFT nowp50 TTFTp95 TTFT24h uptime

Llama 3.3 70B87 checks

223ms

226ms

449ms

100%

Llama 4 Scout87 checks

200ms

330ms

825ms

100%

P50 = typical speed. P95 = worst case 95% of the time. Measured by Tickerr's independent inference checks. Requires ≥10 checks to display.

TTFT over 24 hours

ⓘ Authenticated streaming API calls via native fetch. TTFT = milliseconds from request start to first streamed token chunk. Throughput = output tokens ÷ generation time. Checks run from Vercel us-east-1. Independent of the provider's official status page.

Agent monitoring active · 2 agents reporting · Powered by Tickerr MCP

HTTP endpoint response time (7 days)

p50 385ms·p95 1606ms

ⓘ HTTP response times to Groq's status endpoint - measures infrastructure availability, not API inference speed. For TTFT and model-level API status, see the Groq API Status section above.

90-day uptime

May 26 99.9%Jun 26 100%Jul 26 100%

97.5%

90-day uptime · HTTP + API inference

Apr 13Today

100%99–99.9%95–99% or API failures<95% or major API failures

HTTP pings + API inference checks · checked every 5 min · 90 days

Incident history

DegradedResolvedSynthetic probe

Jul 7, 2026

10:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 4.1× above the rolling p50 baseline (1600ms vs p50 391ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

34m

DegradedResolvedSynthetic probe

Jul 6, 2026

12:00 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 13× above the rolling p50 baseline (5376ms vs p50 415ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct

29m

DegradedResolvedSynthetic probe

Jul 4, 2026

09:15 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 5.3× above the rolling p50 baseline (1920ms vs p50 359ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

34m

DegradedResolvedSynthetic probe

Jul 3, 2026

10:30 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.1× above the rolling p50 baseline (886ms vs p50 419ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jul 2, 2026

03:30 PM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 6.1× above the rolling p50 baseline (1425ms vs p50 233ms). The service is responding but slower than n…

llama-3.3-70b-versatile

34m

Partial OutageResolvedOfficial

Jul 1, 2026

11:11 PM UTC

Data Center Failure Impacting Capacity ↗

Status: Identified We have identified a cooling system failure at one of our US Central data centers that is causing reduced capacity. The team is working on restoring capacity. Users continue to see …

1h 34m

DegradedResolvedSynthetic probe

Jul 1, 2026

12:00 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.2× above the rolling p50 baseline (831ms vs p50 379ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jul 1, 2026

06:15 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.1× above the rolling p50 baseline (1040ms vs p50 332ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jun 30, 2026

10:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.8× above the rolling p50 baseline (1107ms vs p50 392ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jun 29, 2026

09:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.5× above the rolling p50 baseline (1022ms vs p50 404ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jun 26, 2026

06:15 AM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 7.7× above the rolling p50 baseline (1838ms vs p50 240ms). The service is responding but slower than n…

llama-3.3-70b-versatile

30m

DegradedResolvedSynthetic probe

Jun 25, 2026

07:30 PM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 6.9× above the rolling p50 baseline (1661ms vs p50 241ms). The service is responding but slower than n…

llama-3.3-70b-versatile

30m

DegradedResolvedSynthetic probe

Jun 25, 2026

12:15 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.9× above the rolling p50 baseline (3633ms vs p50 941ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

34m

DegradedResolvedSynthetic probe

Jun 24, 2026

12:30 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.2× above the rolling p50 baseline (1331ms vs p50 606ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

34m

DegradedResolvedSynthetic probe

Jun 23, 2026

01:01 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 4.8× above the rolling p50 baseline (4263ms vs p50 889ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

29m

DegradedResolvedSynthetic probe

Jun 23, 2026

05:30 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.7× above the rolling p50 baseline (1922ms vs p50 524ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

49m

DegradedResolvedSynthetic probe

Jun 22, 2026

04:15 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 7.1× above the rolling p50 baseline (5046ms vs p50 710ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

30m

DegradedResolvedSynthetic probe

Jun 22, 2026

12:01 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.8× above the rolling p50 baseline (1512ms vs p50 394ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

33m

DegradedResolvedSynthetic probe

Jun 19, 2026

06:30 AM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 2.3× above the rolling p50 baseline (516ms vs p50 227ms). The service is responding but slower than no…

llama-3.3-70b-versatile

30m

DegradedResolvedSynthetic probe

Jun 18, 2026

01:00 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 5.2× above the rolling p50 baseline (2158ms vs p50 414ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct

29m

Groq message limits and upload caps

Message limits, file upload caps, rate limits by plan

API pricing

Cost per token by model

Compare tools

Side-by-side pricing & uptime

Token counter

Estimate API cost before you send

Groq API not working? Common error codes

If Groq's API is returning errors, the table below explains what each code means and how to fix it. If errors are widespread, check the live status above - a service incident will appear there within minutes.

Error	What it means & what to do
HTTP 429	Rate limit hit - check x-ratelimit-remaining headers; free tier is 30 RPM
HTTP 503	Service overloaded - Groq queues fill fast under heavy load; retry

Note: Tickerr monitors Groq's status endpoint, not individual API calls. An HTTP 429 or 500 in your app may be specific to your account tier - check the rate limits page for plan-specific thresholds.

About Groq status

Groq is an AI inference provider offering extremely fast LLM inference using custom LPU hardware. Groq downtime is rare due to their hardware architecture but can affect API users building latency-sensitive apps. Groq offers free and paid tiers with separate rate limits.

Is Groq down right now?

API Inference Monitoring

HTTP endpoint response time (7 days)

90-day uptime

Incident history

Related pages

Groq API not working? Common error codes

About Groq status