NEWTickerr MCP is live →
tickerr

Tickerr / Status / Groq

Groq

Is Groq down right now?

Authenticated API inference - 2 models monitored · How we classify outages

Groq is currently operational - 215ms HTTP response. Last checked . 90-day uptime: 78.5%. Groq API: all 2 models responding - fastest TTFT 194ms.

Operational215ms response

Stay informed

Follow @tickerr_ai

We post when tier 1 LLM APIs go down - before the official status page updates.

Follow on X →

Or get the weekly reliability digest:

Add a status badge

Show live Groq status on your site, README, or docs.

Groq status

HTTP uptime (90d)

78.5%

16 incidents (90d)

HTTP response now

215ms

HTTP p50 (7d)

368ms

median ping response

HTTP p95 (7d)

1119ms

tail ping response

API Inference Monitoring

Live · every 5 min

Best TTFT (p50)

194ms

time to first token

Best throughput

1508tok/s

output tokens/sec (24h avg)

Min success rate

100%

worst model (24h)

ModelStatusTTFT nowp50 TTFTp95 TTFT24h uptime
Llama 3.3 70B95 checks
311ms
194ms
391ms
100%
Llama 4 Scout95 checks
604ms
349ms
811ms
100%

P50 = typical speed. P95 = worst case 95% of the time. Measured by Tickerr's independent inference checks. Requires ≥10 checks to display.

TTFT over 24 hours

ⓘ Authenticated streaming API calls via native fetch. TTFT = milliseconds from request start to first streamed token chunk. Throughput = output tokens ÷ generation time. Checks run from Vercel us-east-1. Independent of the provider's official status page.

Agent monitoring active · 13 agents reporting · Powered by Tickerr MCP

HTTP endpoint response time (7 days)

p50 368ms·p95 1119ms

HTTP response times to Groq's status endpoint - measures infrastructure availability, not API inference speed. For TTFT and model-level API status, see the Groq API Status section above.

90-day uptime

Apr 26 28.6%May 26 99.9%
78.5%
90-day uptime · HTTP + API inference
Feb 27Today
100%99–99.9%95–99% or API failures<95% or major API failures
HTTP pings + API inference checks · checked every 5 min · 90 days

Incident history

Major OutageResolvedSynthetic probe
May 25, 2026
05:25 PM UTC
Groq service disruption detected
5m
DegradedResolvedSynthetic probe
May 22, 2026
09:00 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.7× above the rolling p50 baseline (2255ms vs p50 603ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
29m
DegradedResolvedSynthetic probe
May 20, 2026
03:30 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 4.9× above the rolling p50 baseline (3116ms vs p50 635ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
34m
DegradedResolvedSynthetic probe
May 19, 2026
12:30 PM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 10.3× above the rolling p50 baseline (2508ms vs p50 243ms). The service is responding but slower than …

llama-3.3-70b-versatile
34m
DegradedResolvedSynthetic probe
May 18, 2026
04:00 PM UTC

llama-3.3-70b-versatile API Latency Degraded

Independent monitoring detected elevated API latency for llama-3.3-70b-versatile. Current TTFT is 25.4× above the rolling p50 baseline (6589ms vs p50 259ms). The service is responding but slower than …

llama-3.3-70b-versatile
34m
DegradedResolvedSynthetic probe
May 18, 2026
08:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.1× above the rolling p50 baseline (643ms vs p50 307ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct
34m
DegradedResolvedSynthetic probe
May 18, 2026
03:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.4× above the rolling p50 baseline (633ms vs p50 269ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct
30m
DegradedResolvedSynthetic probe
May 16, 2026
10:30 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.2× above the rolling p50 baseline (432ms vs p50 195ms). The service is responding …

meta-llama/llama-4-scout-17b-16e-instruct
34m
DegradedResolvedSynthetic probe
May 15, 2026
04:01 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 7.7× above the rolling p50 baseline (1897ms vs p50 245ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
33m
DegradedResolvedSynthetic probe
May 11, 2026
01:30 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 7.7× above the rolling p50 baseline (3827ms vs p50 495ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
34m
DegradedResolvedSynthetic probe
May 8, 2026
01:45 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.5× above the rolling p50 baseline (1157ms vs p50 472ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
30m
DegradedResolvedSynthetic probe
May 7, 2026
02:00 PM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3× above the rolling p50 baseline (2677ms vs p50 900ms). The service is responding b…

meta-llama/llama-4-scout-17b-16e-instruct
30m
DegradedResolvedSynthetic probe
May 7, 2026
09:45 AM UTC

meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 2.9× above the rolling p50 baseline (1679ms vs p50 582ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
34m
DegradedResolvedSynthetic probe
May 5, 2026
12:30 PM UTC
meta-llama/llama-4-scout-17b-16e-instruct API Latency Degraded

Independent monitoring detected elevated API latency for meta-llama/llama-4-scout-17b-16e-instruct. Current TTFT is 3.6× above the rolling p50 baseline (1642ms vs p50 455ms). The service is responding…

meta-llama/llama-4-scout-17b-16e-instruct
4m
Major OutageResolvedSynthetic probe
Apr 7, 2026
08:55 AM UTC
Service disruption detected
10d 1h
Partial OutageResolvedOfficial
Mar 19, 2026
01:50 PM UTC
openai/gpt-oss-120b Performance Issue

Status: Resolved The issues affecting openai/gpt-oss-120b have been resolved. The model is operating normally. Actions were taken to cancel billing plans and restrict verification status of organizati…

0m

Related pages

Groq API not working? Common error codes

If Groq's API is returning errors, the table below explains what each code means and how to fix it. If errors are widespread, check the live status above - a service incident will appear there within minutes.

ErrorWhat it means & what to do
HTTP 429Rate limit hit - check x-ratelimit-remaining headers; free tier is 30 RPM
HTTP 503Service overloaded - Groq queues fill fast under heavy load; retry

Note: Tickerr monitors Groq's status endpoint, not individual API calls. An HTTP 429 or 500 in your app may be specific to your account tier - check the rate limits page for plan-specific thresholds.

About Groq status

Groq is an AI inference provider offering extremely fast LLM inference using custom LPU hardware. Groq downtime is rare due to their hardware architecture but can affect API users building latency-sensitive apps. Groq offers free and paid tiers with separate rate limits.