OpenAI compatible API. Attested gateway. Public status.

Baseten

Baseten models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway

1 URLbase_url migration

100smodels and routes

0prompt logs by default

`baseten`

No provider claim

All providers

Provider	Baseten
Models	11 public models
Prepaid routes	11
BYOK routes	11
Zero data retention	not claimed
Confidential compute	not claimed
Provider E2EE	not claimed
Policy note	No provider-ZDR claim is tracked here. Baseten's inference and security documentation are linked for users who need to review API data handling. Policy source

Measured performance

259 samples

Continuously sampled across Baseten's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT	7260 ms
Throughput	82 tok/s
Uptime	97.68%

Model	p50 TTFT	p50 TTFB	Throughput	Uptime	Config excluded	Samples
z-ai/glm-5.2	5469 ms	5468 ms	78 tok/s	93.33%	—	75
moonshotai/kimi-k2.7-code	7140 ms	7139 ms	85 tok/s	100.00%	—	45
deepseek/deepseek-v4-pro	7260 ms	7259 ms	82 tok/s	100.00%	—	44
z-ai/glm-5	8012 ms	8011 ms	—	91.67%	—	12
z-ai/glm-5.1	8163 ms	8162 ms	—	100.00%	—	11
nvidia/nemotron-120b-a12b	8751 ms	8751 ms	—	100.00%	—	12
z-ai/glm-4.7	8923 ms	8923 ms	—	100.00%	—	9
moonshotai/kimi-k2.5	9280 ms	9279 ms	—	100.00%	—	19
openai/gpt-oss-120b	10628 ms	10627 ms	—	100.00%	—	15
moonshotai/kimi-k2.6	11430 ms	11429 ms	—	100.00%	—	7
nvidia/nvidia-nemotron-3-ultra-550b-a55b	12384 ms	12383 ms	—	100.00%	—	10

Full provider & model leaderboard.

Provider models

Models served by Baseten.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model	AI IQ	Context	Endpoints	Prompt	Completion	Routes
`deepseek/deepseek-v4-pro` DeepSeek: DeepSeek V4 Pro benchmarks performance api	IQ 109#28	1,048,576	2	$1.914/1M	$3.828/1M	prepaid BYOK
`moonshotai/kimi-k2.5` MoonshotAI: Kimi K2.5 benchmarks performance api	IQ 109#29	262,144	2	$0.66/1M	$3.3/1M	prepaid BYOK
`moonshotai/kimi-k2.6` MoonshotAI: Kimi K2.6 benchmarks performance api	IQ 117#11	262,144	2	$1.045/1M	$4.4/1M	prepaid BYOK
`moonshotai/kimi-k2.7-code` MoonshotAI: Kimi K2.7 Code benchmarks performance api	IQ 116#13	262,144	2	$1.045/1M	$4.4/1M	prepaid BYOK
`nvidia/nemotron-120b-a12b` Nemotron 120B A12B benchmarks performance api	—	202,800	2	$0.33/1M	$0.825/1M	prepaid BYOK
`nvidia/nvidia-nemotron-3-ultra-550b-a55b` NVIDIA Nemotron 3 Ultra 550B A55B benchmarks performance api	—	202,800	2	$0.66/1M	$2.64/1M	prepaid BYOK
`openai/gpt-oss-120b` OpenAI: gpt-oss-120b benchmarks performance api	IQ 95#59	131,072	2	$0.11/1M	$0.55/1M	prepaid BYOK
`z-ai/glm-4.7` Z.ai: GLM 4.7 benchmarks performance api	IQ 102#46	202,752	2	$0.66/1M	$2.42/1M	prepaid BYOK
`z-ai/glm-5` Z.ai: GLM 5 benchmarks performance api	IQ 107#34	204,800	2	$1.045/1M	$3.465/1M	prepaid BYOK
`z-ai/glm-5.1` Z.ai: GLM 5.1 benchmarks performance api	IQ 113#19	202,752	2	$1.43/1M	$4.73/1M	prepaid BYOK
`z-ai/glm-5.2` GLM 5.2 benchmarks performance api	IQ 117#10	1,048,576	2	$1.54/1M	$4.84/1M	prepaid BYOK