Is Cerebras down right now?
Authenticated API inference - 2 models monitored · How we classify outages
Cerebras is currently down - API inference failing (5+ consecutive checks) - 356ms HTTP response. Last checked . 90-day uptime: 99.9%. Cerebras API: 0/2 models up - check inference section below.
llama3.1-8b API Outage
Stay informed
HTTP uptime (90d)
99.9%
20 incidents (90d)
HTTP response now
356ms
HTTP p50 (7d)
675ms
median ping response
HTTP p95 (7d)
5468ms
tail ping response
API Inference Monitoring
Live · every 5 minBest TTFT (p50)
—
time to first token
Best throughput
—
output tokens/sec (24h avg)
Min success rate
0%
worst model (24h)
P50 = typical speed. P95 = worst case 95% of the time. Measured by Tickerr's independent inference checks. Requires ≥10 checks to display.
TTFT over 24 hours
ⓘ Authenticated streaming API calls via native fetch. TTFT = milliseconds from request start to first streamed token chunk. Throughput = output tokens ÷ generation time. Checks run from Vercel us-east-1. Independent of the provider's official status page.
Agent monitoring active · 6 agents reporting · Powered by Tickerr MCP
HTTP endpoint response time (7 days)
p50 675ms·p95 5468msⓘ HTTP response times to Cerebras's status endpoint - measures infrastructure availability, not API inference speed. For TTFT and model-level API status, see the Cerebras API Status section above.
90-day uptime
Incident history
Customer using AWS billing may experience availability issues. Exact reason is being investigated.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for qwen-3-235b-a22b-instruct-2507. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
Independent monitoring detected consecutive API failures for llama3.1-8b. Tickerr measures time-to-first-token (TTFT) via live streaming API calls every 5 minutes.
qwen-3-235b-a22b-instruct-2507 API Latency Degraded
Independent monitoring detected elevated API latency for qwen-3-235b-a22b-instruct-2507. Current TTFT is 7.9× above the rolling p50 baseline (3692ms vs p50 469ms). The service is responding but slower…
llama3.1-8b API Latency Degraded
Independent monitoring detected elevated API latency for llama3.1-8b. Current TTFT is 16.5× above the rolling p50 baseline (2225ms vs p50 135ms). The service is responding but slower than normal. Tick…
qwen-3-235b-a22b-instruct-2507 API Latency Degraded
Independent monitoring detected elevated API latency for qwen-3-235b-a22b-instruct-2507. Current TTFT is 3.5× above the rolling p50 baseline (1314ms vs p50 374ms). The service is responding but slower…
qwen-3-235b-a22b-instruct-2507 API Latency Degraded
Independent monitoring detected elevated API latency for qwen-3-235b-a22b-instruct-2507. Current TTFT is 3.5× above the rolling p50 baseline (954ms vs p50 274ms). The service is responding but slower …
Related pages
About Cerebras status
This page tracks the live operational status of Cerebras by Cerebras Systems. We check Cerebras every 5 minutes and record the result. If Cerebras is down or experiencing an outage, it will be reflected here within minutes. Historical uptime data covers the last 90 days.