How often is LLM pricing updated?

Prices are updated within 24 hours of a provider announcement, pulled from official documentation.

What is TTFT p50 for LLM APIs?

TTFT (time-to-first-token) p50 is the median latency from sending a request until the API starts streaming the first token, measured over 7 days.

Tickerr / LLM Models

LLM model pricing (2026) - API cost comparison

Q: What does cached pricing mean?

Cached pricing applies to prompt cache hits where the provider has already processed that prefix. Offered by Anthropic, OpenAI and others at a discount.

Token costs for 256 large language models from 10 providers. Input, output, and cached pricing per 1M tokens. Live latency benchmarks for 8 models updated every 5 minutes.

Recently added / updated (last 90 days)

MMeta

Muse Spark 1.1

$1.25/1M

Jul 12

xAI

Grok 4.5 Latest

$2.00/1M

Jul 10

xAI

Grok 4.5

$2.00/1M

Jul 10

Mistral AI

Mistral Medium 2604

$1.50/1M

Jun 27

Mistral AI

Mistral Medium 2508

$0.40/1M

Jun 27

Mistral AI

Mistral Medium Latest

$1.50/1M

Jun 27

Cohere

Command R7b 12 2024

$0.04/1M

Jun 21

Weekly AI pricing & uptime digest

Price drops, new model releases, and incident summaries - every Monday. Free.

256 models

Model⇅	Provider	Input /1M▲	Output /1M⇅	TTFT p50⇅	HTTP p50⇅	Tok/s⇅	Outages⇅
Whisper (audio)	OpenAI	$0.0060	—	—	—	—	10× 21h
Command R7b 12 2024	Cohere	$0.04	$0.15	—	—	—	—
Qwen Turbo	QAlibaba Cloud	$0.05	$0.20	—	—	—	—
Qwen Turbo Latest	QAlibaba Cloud	$0.05	$0.20	—	—	—	—
gpt-5-nano	OpenAI	$0.05	$0.40	—	—	—	10× 21h
Mistral Small Latest	Mistral AI	$0.06	$0.18	371ms	385ms	188	—
Mistral Small 3.2 2506	Mistral AI	$0.06	$0.18	—	—	—	—
Mistral 7b Instruct	Perplexity	$0.07	$0.28	—	—	—	—
Mixtral 8x7b Instruct	Perplexity	$0.07	$0.28	—	—	—	—
Pplx 7b Chat	Perplexity	$0.07	$0.28	—	—	—	—
Sonar Small Chat	Perplexity	$0.07	$0.28	—	—	—	—
Gemini 2.0 Flash Lite	Google	$0.07	$0.30	—	—	—	12× 8h
Gemini 2.0 Flash Lite 001	Google	$0.07	$0.30	—	—	—	12× 8h
Gemini 2.5 Flash Lite Preview 09.2025	Google	$0.10	$0.40	—	—	—	12× 8h
Gemini 2.5 Flash Lite Preview 06.17	Google	$0.10	$0.40	—	—	—	12× 8h
Gemini 2.0 Flash	Google	$0.10	$0.40	—	—	—	12× 8h
Gemini 2.0 Flash 001	Google	$0.10	$0.40	—	—	—	12× 8h
Gemini 2.5 Flash Lite	Google	$0.10	$0.40	—	—	—	12× 8h
Gemini Flash Lite Latest	Google	$0.10	$0.40	—	—	—	12× 8h
gpt-4.1-nano	OpenAI	$0.10	$0.40	—	—	—	10× 21h
Devstral Small 2505	Mistral AI	$0.10	$0.30	—	—	—	—
Devstral Small 2507	Mistral AI	$0.10	$0.30	—	—	—	—
Devstral Small Latest	Mistral AI	$0.10	$0.30	—	—	—	—
Labs Devstral Small 2512	Mistral AI	$0.10	$0.30	—	—	—	—
Mistral Small	Mistral AI	$0.10	$0.30	—	—	—	—
Ministral 3.3b 2512	Mistral AI	$0.10	$0.10	—	—	—	—
Embed v3 English Embedding model — input only	Cohere	$0.10	—	—	—	—	—
Deepseek V4 Flash	DeepSeek	$0.14	$0.28	—	—	—	—
Deepseek Coder	DeepSeek	$0.14	$0.28	—	—	—	—
Ministral 8b Latest	Mistral AI	$0.15	$0.15	—	—	—	—
Ministral 8b 2512	Mistral AI	$0.15	$0.15	—	—	—	—
Command R	Cohere	$0.15	$0.60	—	—	—	—
Command R 08 2024	Cohere	$0.15	$0.60	—	—	—	—
Qwen3 Next 80b A3b Instruct	QAlibaba Cloud	$0.15	$1.20	—	—	—	—
Qwen3 Next 80b A3b Thinking	QAlibaba Cloud	$0.15	$1.20	—	—	—	—
gpt-4o-mini	OpenAI	$0.15	$0.60	871ms	887ms	67	10× 21h
gpt-4o-mini-audio-preview	OpenAI	$0.15	$0.60	—	—	—	10× 21h
gpt-4o-mini-search-preview	OpenAI	$0.15	$0.60	—	—	—	10× 21h
Pixtral 12b 2409	Mistral AI	$0.15	$0.15	—	—	—	—
Ministral 3.8b 2512	Mistral AI	$0.15	$0.15	—	—	—	—
Command Light	Cohere	$0.15	$0.60	—	—	—	—
Command R	Cohere	$0.15	$0.60	—	—	—	—
Qwen3 Vl 32b Instruct	QAlibaba Cloud	$0.16	$0.64	—	—	—	—
Qwen3 Vl 32b Thinking	QAlibaba Cloud	$0.16	$2.87	—	—	—	—
gpt-5.4-nano	OpenAI	$0.20	$1.25	—	—	—	10× 21h
Ministral 3.14b 2512	Mistral AI	$0.20	$0.20	—	—	—	—
Llama 3.1.8b Instruct	Perplexity	$0.20	$0.20	—	—	—	—
Grok 4 Fast Reasoning	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok 4 Fast Non Reasoning	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok Code Fast	xAI	$0.20	$1.50	—	—	—	1× 1h
Grok Code Fast 1	xAI	$0.20	$1.50	—	—	—	1× 1h
Grok 4.1 Fast	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok 4.1 Fast Reasoning	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok 4.1 Fast Reasoning Latest	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok 4.1 Fast Non Reasoning	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok 4.1 Fast Non Reasoning Latest	xAI	$0.20	$0.50	—	—	—	1× 1h
Grok Code Fast 1.0825	xAI	$0.20	$1.50	—	—	—	1× 1h
Gemini 3.1 Flash Lite	Google	$0.25	$1.50	—	—	—	12× 8h
Claude 3 Haiku 20240307	Anthropic	$0.25	$1.25	—	—	—	15× 37h
Claude 3 Haiku	Anthropic	$0.25	$1.25	—	—	—	15× 37h
Gemini 3.1 Flash Lite Preview	Google	$0.25	$1.50	—	—	—	12× 8h
gpt-5.1-codex-mini	OpenAI	$0.25	$2.00	—	—	—	10× 21h
gpt-5-mini	OpenAI	$0.25	$2.00	—	—	—	10× 21h
Codestral Mamba Latest	Mistral AI	$0.25	$0.25	—	—	—	—
Mistral Tiny	Mistral AI	$0.25	$0.25	—	—	—	—
Open Codestral Mamba	Mistral AI	$0.25	$0.25	—	—	—	—
Open Mistral 7b	Mistral AI	$0.25	$0.25	—	—	—	—
Deepseek V3	DeepSeek	$0.27	$1.10	—	—	—	—
Deepseek Chat	DeepSeek	$0.28	$0.42	—	—	—	—
Deepseek Reasoner	DeepSeek	$0.28	$0.42	—	—	—	—
Deepseek V3.2	DeepSeek	$0.28	$0.40	—	—	—	—
Command Light	Cohere	$0.30	$0.60	—	—	—	—
Qwen Coder	QAlibaba Cloud	$0.30	$1.50	—	—	—	—
Gemini Live 2.5 Flash Preview Native Audio 09.2025	Google	$0.30	$2.00	—	—	—	12× 8h
Gemini Robotics Er 1.5 Preview	Google	$0.30	$2.50	—	—	—	12× 8h
Gemini 2.5 Flash Preview 09.2025	Google	$0.30	$2.50	—	—	—	12× 8h
Gemini 2.5 Flash	Google	$0.30	$2.50	—	—	—	12× 8h
Gemini Flash Latest	Google	$0.30	$2.50	—	—	—	12× 8h
Codestral 2508	Mistral AI	$0.30	$0.90	—	—	—	—
Open Mistral Nemo	Mistral AI	$0.30	$0.30	—	—	—	—
Open Mistral Nemo 2407	Mistral AI	$0.30	$0.30	—	—	—	—
Grok 3 Mini Beta	xAI	$0.30	$0.50	—	—	—	1× 1h
Grok 3 Mini Latest	xAI	$0.30	$0.50	—	—	—	1× 1h
Gemini 2.5 Flash Native Audio Preview 09.2025	Google	$0.30	$2.50	—	—	—	12× 8h
Gemini 2.5 Flash Native Audio Preview 12.2025	Google	$0.30	$2.50	—	—	—	12× 8h
Gemini 2.5 Flash Native Audio Latest	Google	$0.30	$2.50	—	—	—	12× 8h
Codestral	Mistral AI	$0.30	$0.90	—	—	—	—
Gemini Gemma 2.27b It	Google	$0.35	$1.05	—	—	—	12× 8h
Gemini Gemma 2.9b It	Google	$0.35	$1.05	—	—	—	12× 8h
Codellama 34b Instruct	Perplexity	$0.35	$1.40	—	—	—	—
Mistral Medium 2508	Mistral AI	$0.40	$2.00	—	—	—	—
babbage-002	OpenAI	$0.40	$0.40	—	—	—	10× 21h
Qwen Plus	QAlibaba Cloud	$0.40	$1.20	—	—	—	—
Qwen3 Vl 235b A22b Instruct	QAlibaba Cloud	$0.40	$1.60	—	—	—	—
Qwen3 Vl 235b A22b Thinking	QAlibaba Cloud	$0.40	$4.00	—	—	—	—
gpt-4.1-mini	OpenAI	$0.40	$1.60	—	—	—	10× 21h
Devstral Medium 2507	Mistral AI	$0.40	$2.00	—	—	—	—
Devstral Latest	Mistral AI	$0.40	$2.00	—	—	—	—
Devstral Medium Latest	Mistral AI	$0.40	$2.00	—	—	—	—
Devstral 2512	Mistral AI	$0.40	$2.00	—	—	—	—
Mistral Medium 2505	Mistral AI	$0.40	$2.00	—	—	—	—
Mistral Medium 3.1 2508	Mistral AI	$0.40	$2.00	—	—	—	—
Deepseek V4 Pro	DeepSeek	$0.43	$0.87	—	—	—	—
Gemini 3 Flash Preview	Google	$0.50	$3.00	—	—	—	12× 8h
gpt-3.5-turbo	OpenAI	$0.50	$1.50	—	—	—	10× 21h
gpt-3.5-turbo-0125	OpenAI	$0.50	$1.50	—	—	—	10× 21h
Magistral Small 1.2 2509	Mistral AI	$0.50	$1.50	—	—	—	—
Magistral Small 2506	Mistral AI	$0.50	$1.50	—	—	—	—
Magistral Small Latest	Mistral AI	$0.50	$1.50	—	—	—	—
Mistral Large Latest	Mistral AI	$0.50	$1.50	495ms	533ms	77	—
Mistral Large 3	Mistral AI	$0.50	$1.50	—	—	—	—
Mistral Large 2512	Mistral AI	$0.50	$1.50	—	—	—	—
Deepseek R1	DeepSeek	$0.55	$2.19	—	—	—	—
gpt-audio-mini	OpenAI	$0.60	$2.40	—	—	—	10× 21h
gpt-4o-mini-realtime-preview	OpenAI	$0.60	$2.40	—	—	—	10× 21h
gpt-realtime-mini	OpenAI	$0.60	$2.40	—	—	—	10× 21h
Sonar Medium Chat	Perplexity	$0.60	$1.80	—	—	—	—
Grok 3 Mini Fast	xAI	$0.60	$4.00	—	—	—	1× 1h
Grok 3 Mini Fast Beta	xAI	$0.60	$4.00	—	—	—	1× 1h
Grok 3 Mini Fast Latest	xAI	$0.60	$4.00	—	—	—	1× 1h
Open Mixtral 8x7b	Mistral AI	$0.70	$0.70	—	—	—	—
Llama 2.70b Chat	Perplexity	$0.70	$2.80	—	—	—	—
Codellama 70b Instruct	Perplexity	$0.70	$2.80	—	—	—	—
Pplx 70b Chat	Perplexity	$0.70	$2.80	—	—	—	—
gpt-5.4-mini	OpenAI	$0.75	$4.50	—	—	—	10× 21h
Gemini 3.1 Flash Live Preview	Google	$0.75	$4.50	—	—	—	12× 8h
Qwq Plus	QAlibaba Cloud	$0.80	$2.40	—	—	—	—
Command Nightly	Cohere	$1.00	$2.00	—	—	—	—
Claude Haiku 4.5	Anthropic	$1.00	$5.00	655ms	711ms	106	15× 37h
command	Cohere	$1.00	$2.00	—	—	—	—
gpt-3.5-turbo-1106	OpenAI	$1.00	$2.00	—	—	—	10× 21h
Codestral 2405	Mistral AI	$1.00	$3.00	—	—	—	—
Codestral Latest	Mistral AI	$1.00	$3.00	—	—	—	—
Llama 3.1.70b Instruct	Perplexity	$1.00	$1.00	—	—	—	—
Sonar	Perplexity	$1.00	$1.00	—	—	—	—
Sonar Reasoning	Perplexity	$1.00	$5.00	—	—	—	—
o3-mini	OpenAI	$1.10	$4.40	—	—	—	10× 21h
o4-mini	OpenAI	$1.10	$4.40	—	—	—	10× 21h
Muse Spark 1.1	MMeta	$1.25	$4.25	—	—	—	—
Grok 4.3	xAI	$1.25	$2.50	—	—	—	1× 1h
Grok 4.3 Latest	xAI	$1.25	$2.50	—	—	—	1× 1h
Gemini Pro Latest	Google	$1.25	$10.00	—	—	—	12× 8h
Gemini 2.5 Computer Use Preview 10.2025	Google	$1.25	$10.00	—	—	—	12× 8h
Gemini 2.5 Pro	Google	$1.25	$10.00	—	—	—	12× 8h
Gemini 2.5 Pro Preview Tts	Google	$1.25	$10.00	—	—	—	12× 8h
gpt-5	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5.1	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5.1-chat-latest	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5-chat	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5-chat-latest	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5-codex	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5.1-codex	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5.1-codex-max	OpenAI	$1.25	$10.00	—	—	—	10× 21h
gpt-5-search-api	OpenAI	$1.25	$10.00	—	—	—	10× 21h
Mistral Medium 2604	Mistral AI	$1.50	$7.50	—	—	—	—
Mistral Medium Latest	Mistral AI	$1.50	$7.50	—	—	—	—
Mistral Medium 3.5	Mistral AI	$1.50	$7.50	—	—	—	—
codex-mini-latest	OpenAI	$1.50	$6.00	—	—	—	10× 21h
gpt-3.5-turbo-instruct	OpenAI	$1.50	$2.00	—	—	—	10× 21h
gpt-3.5-turbo-instruct-0914	OpenAI	$1.50	$2.00	—	—	—	10× 21h
Qwen Max	QAlibaba Cloud	$1.60	$6.40	—	—	—	—
gpt-5.2	OpenAI	$1.75	$14.00	—	—	—	10× 21h
gpt-5.2-chat-latest	OpenAI	$1.75	$14.00	—	—	—	10× 21h
gpt-5.3-chat-latest	OpenAI	$1.75	$14.00	—	—	—	10× 21h
gpt-5.2-codex	OpenAI	$1.75	$14.00	—	—	—	10× 21h
gpt-5.3-codex	OpenAI	$1.75	$14.00	—	—	—	10× 21h
Grok 4.5 Latest	xAI	$2.00	$6.00	—	—	—	1× 1h
Grok 4.5	xAI	$2.00	$6.00	—	—	—	1× 1h
davinci-002	OpenAI	$2.00	$2.00	—	—	—	10× 21h
Gemini 3 Pro Preview	Google	$2.00	$12.00	—	—	—	12× 8h
Gemini 3.1 Pro Preview	Google	$2.00	$12.00	—	—	—	12× 8h
Gemini 3.1 Pro Preview Customtools	Google	$2.00	$12.00	—	—	—	12× 8h
gpt-4.1	OpenAI	$2.00	$8.00	—	—	—	10× 21h
Magistral Medium 1.2 2509	Mistral AI	$2.00	$5.00	—	—	—	—
Magistral Medium 2506	Mistral AI	$2.00	$5.00	—	—	—	—
Magistral Medium 2509	Mistral AI	$2.00	$5.00	—	—	—	—
Magistral Medium Latest	Mistral AI	$2.00	$5.00	—	—	—	—
Mistral Large 2411	Mistral AI	$2.00	$6.00	—	—	—	—
Open Mixtral 8x22b	Mistral AI	$2.00	$6.00	—	—	—	—
Pixtral Large 2411	Mistral AI	$2.00	$6.00	—	—	—	—
Pixtral Large Latest	Mistral AI	$2.00	$6.00	—	—	—	—
o3	OpenAI	$2.00	$8.00	—	—	—	10× 21h
o4-mini-deep-research	OpenAI	$2.00	$8.00	—	—	—	10× 21h
Sonar Deep Research	Perplexity	$2.00	$8.00	—	—	—	—
Sonar Reasoning Pro	Perplexity	$2.00	$8.00	—	—	—	—
Grok 2.1212	xAI	$2.00	$10.00	—	—	—	1× 1h
Grok 4.20 Multi Agent Beta 0309	xAI	$2.00	$6.00	—	—	—	1× 1h
Grok 4.20 Beta 0309 Reasoning	xAI	$2.00	$6.00	—	—	—	1× 1h
Grok 4.20 Beta 0309 Non Reasoning	xAI	$2.00	$6.00	—	—	—	1× 1h
Grok 4.20.0309 Reasoning	xAI	$2.00	$6.00	—	—	—	1× 1h
Grok 2	xAI	$2.00	$10.00	—	—	—	1× 1h
Grok 2 Latest	xAI	$2.00	$10.00	—	—	—	1× 1h
Grok 2 Vision	xAI	$2.00	$10.00	—	—	—	1× 1h
Grok 2 Vision 1212	xAI	$2.00	$10.00	—	—	—	1× 1h
Grok 2 Vision Latest	xAI	$2.00	$10.00	—	—	—	1× 1h
Mistral Large	Mistral AI	$2.00	$6.00	—	—	—	—
Command R Plus	Cohere	$2.50	$10.00	—	—	—	—
Command R Plus 08 2024	Cohere	$2.50	$10.00	292ms	312ms	53	—
Command A 03 2025	Cohere	$2.50	$10.00	—	—	—	—
gpt-4o	OpenAI	$2.50	$10.00	—	—	—	10× 21h
gpt-4o-audio-preview	OpenAI	$2.50	$10.00	—	—	—	10× 21h
gpt-audio	OpenAI	$2.50	$10.00	—	—	—	10× 21h
gpt-audio-1.5	OpenAI	$2.50	$10.00	—	—	—	10× 21h
gpt-4o-search-preview	OpenAI	$2.50	$10.00	—	—	—	10× 21h
gpt-5.4	OpenAI	$2.50	$15.00	—	—	—	10× 21h
Command R+	Cohere	$2.50	$10.00	—	—	—	—
Mistral Medium 2312	Mistral AI	$2.70	$8.10	—	—	—	—
Mistral Medium	Mistral AI	$2.70	$8.10	—	—	—	—
Claude Sonnet 4	Anthropic	$3.00	$15.00	—	—	—	15× 37h
Claude 4 Sonnet	Anthropic	$3.00	$15.00	—	—	—	15× 37h
Claude 3 7 Sonnet 20250219	Anthropic	$3.00	$15.00	—	—	—	15× 37h
Claude 3.7 Sonnet	Anthropic	$3.00	$15.00	—	—	—	15× 37h
Claude Sonnet 4.5	Anthropic	$3.00	$15.00	—	—	—	15× 37h
Claude Sonnet 4.6	Anthropic	$3.00	$15.00	1047ms	1148ms	57	15× 37h
gpt-3.5-turbo-16k	OpenAI	$3.00	$4.00	—	—	—	10× 21h
Mistral Large 2407	Mistral AI	$3.00	$9.00	—	—	—	—
Sonar Pro	Perplexity	$3.00	$15.00	—	—	—	—
Grok 3 Beta	xAI	$3.00	$15.00	—	—	—	1× 1h
Grok 3 Latest	xAI	$3.00	$15.00	—	—	—	1× 1h
Grok 4	xAI	$3.00	$15.00	—	—	—	1× 1h
Grok 4 Latest	xAI	$3.00	$15.00	—	—	—	1× 1h
Grok 4.0709	xAI	$3.00	$15.00	—	—	—	1× 1h
gpt-realtime-2	OpenAI	$4.00	$16.00	—	—	—	10× 21h
gpt-realtime	OpenAI	$4.00	$16.00	—	—	—	10× 21h
gpt-realtime-1.5	OpenAI	$4.00	$16.00	—	—	—	10× 21h
Mistral Large 2402	Mistral AI	$4.00	$12.00	—	—	—	—
chatgpt-4o-latest	OpenAI	$5.00	$15.00	—	—	—	10× 21h
Claude Opus 4.5	Anthropic	$5.00	$25.00	—	—	—	15× 37h
Claude Opus 4.6	Anthropic	$5.00	$25.00	2129ms	2220ms	59	15× 37h
Claude Opus 4.7	Anthropic	$5.00	$25.00	1395ms	1473ms	65	15× 37h
gpt-4o-realtime-preview	OpenAI	$5.00	$20.00	—	—	—	10× 21h
gpt-5.5	OpenAI	$5.00	$30.00	—	—	—	10× 21h
Grok 3 Fast Beta	xAI	$5.00	$25.00	—	—	—	1× 1h
Grok 3 Fast Latest	xAI	$5.00	$25.00	—	—	—	1× 1h
Grok Beta	xAI	$5.00	$15.00	—	—	—	1× 1h
Grok Vision Beta	xAI	$5.00	$15.00	—	—	—	1× 1h
gpt-4-0125-preview	OpenAI	$10.00	$30.00	—	—	—	10× 21h
gpt-4-1106-preview	OpenAI	$10.00	$30.00	—	—	—	10× 21h
gpt-4-turbo	OpenAI	$10.00	$30.00	—	—	—	10× 21h
gpt-4-turbo-preview	OpenAI	$10.00	$30.00	—	—	—	10× 21h
o3-deep-research	OpenAI	$10.00	$40.00	—	—	—	10× 21h
Claude Opus 4	Anthropic	$15.00	$75.00	—	—	—	15× 37h
Claude 4 Opus	Anthropic	$15.00	$75.00	—	—	—	15× 37h
Claude 3 Opus 20240229	Anthropic	$15.00	$75.00	—	—	—	15× 37h
Claude 3 Opus	Anthropic	$15.00	$75.00	—	—	—	15× 37h
Claude Opus 4.1	Anthropic	$15.00	$75.00	—	—	—	15× 37h
gpt-5-pro	OpenAI	$15.00	$120.00	—	—	—	10× 21h
o1	OpenAI	$15.00	$60.00	—	—	—	10× 21h
o3-pro	OpenAI	$20.00	$80.00	—	—	—	10× 21h
gpt-5.2-pro	OpenAI	$21.00	$168.00	—	—	—	10× 21h
gpt-4	OpenAI	$30.00	$60.00	—	—	—	10× 21h
gpt-4-0314	OpenAI	$30.00	$60.00	—	—	—	10× 21h
gpt-4-0613	OpenAI	$30.00	$60.00	—	—	—	10× 21h
gpt-5.5-pro	OpenAI	$30.00	$180.00	—	—	—	10× 21h
gpt-5.4-pro	OpenAI	$30.00	$180.00	—	—	—	10× 21h
o1-pro	OpenAI	$150.00	$600.00	—	—	—	10× 21h

TTFT = time-to-first-token · HTTP = end-to-end response time · Tok/s = generation speed · Outages = last 7 days. Scroll right for all columns. Hover values for p95 + latest.

Frequently asked questions

How often is pricing updated?

Prices are updated within 24 hours of a provider announcement. We pull from official documentation and pricing pages.

What is TTFT p50?

TTFT (time-to-first-token) is how long the API takes to start streaming. p50 is the median - half of checks were faster, half slower. Measured over 7 days of live checks.

What is HTTP p50?

The median end-to-end round-trip time for a complete API response, measured from our monitoring infrastructure. Higher than TTFT since it includes the full generation time.

What does Tok/s mean?

Tokens per second - the 7-day median generation speed. Higher is faster. Only available for models we actively probe.

How are outages counted?

An outage is an incident lasting 15+ minutes where the API returns errors or is unreachable. Count and total downtime shown for the last 7 days.

What does "cached" pricing mean?

Some providers (Anthropic, OpenAI) offer discounted rates for prompt cache hits - repeated prefixes that the API has already processed. Not all models support this.

Prices are per 1M tokens in USD. Cached pricing applies to prompt cache hits where supported. Latency benchmarks are from automated API probes every 5 minutes. Sourced from official provider documentation.

Also: All AI tool pricing · Compare AI tools · Free tier comparison · Token counter