OpenAI compatible API. Attested gateway. Public status.

Baseten

Baseten models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

baseten

No provider claim

All providers

ProviderBaseten
Models11 public models
Prepaid routes11
BYOK routes11
Zero data retentionnot claimed
Confidential computenot claimed
Provider E2EEnot claimed
Policy noteNo provider-ZDR claim is tracked here. Baseten's inference and security documentation are linked for users who need to review API data handling.
Policy source

Measured performance

259 samples

Continuously sampled across Baseten's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT7260 ms
Throughput82 tok/s
Uptime97.68%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
z-ai/glm-5.2 5469 ms 5468 ms 78 tok/s 93.33% 75
moonshotai/kimi-k2.7-code 7140 ms 7139 ms 85 tok/s 100.00% 45
deepseek/deepseek-v4-pro 7260 ms 7259 ms 82 tok/s 100.00% 44
z-ai/glm-5 8012 ms 8011 ms 91.67% 12
z-ai/glm-5.1 8163 ms 8162 ms 100.00% 11
nvidia/nemotron-120b-a12b 8751 ms 8751 ms 100.00% 12
z-ai/glm-4.7 8923 ms 8923 ms 100.00% 9
moonshotai/kimi-k2.5 9280 ms 9279 ms 100.00% 19
openai/gpt-oss-120b 10628 ms 10627 ms 100.00% 15
moonshotai/kimi-k2.6 11430 ms 11429 ms 100.00% 7
nvidia/nvidia-nemotron-3-ultra-550b-a55b 12384 ms 12383 ms 100.00% 10

Full provider & model leaderboard.

Provider models

Models served by Baseten.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model AI IQ Context Endpoints Prompt Completion Routes
deepseek/deepseek-v4-pro
DeepSeek: DeepSeek V4 Pro
IQ 109#28 1,048,576 2 $1.914/1M $3.828/1M prepaid BYOK
moonshotai/kimi-k2.5
MoonshotAI: Kimi K2.5
IQ 109#29 262,144 2 $0.66/1M $3.3/1M prepaid BYOK
moonshotai/kimi-k2.6
MoonshotAI: Kimi K2.6
IQ 117#11 262,144 2 $1.045/1M $4.4/1M prepaid BYOK
moonshotai/kimi-k2.7-code
MoonshotAI: Kimi K2.7 Code
IQ 116#13 262,144 2 $1.045/1M $4.4/1M prepaid BYOK
nvidia/nemotron-120b-a12b
Nemotron 120B A12B
202,800 2 $0.33/1M $0.825/1M prepaid BYOK
nvidia/nvidia-nemotron-3-ultra-550b-a55b
NVIDIA Nemotron 3 Ultra 550B A55B
202,800 2 $0.66/1M $2.64/1M prepaid BYOK
openai/gpt-oss-120b
OpenAI: gpt-oss-120b
IQ 95#59 131,072 2 $0.11/1M $0.55/1M prepaid BYOK
z-ai/glm-4.7
Z.ai: GLM 4.7
IQ 102#46 202,752 2 $0.66/1M $2.42/1M prepaid BYOK
z-ai/glm-5
Z.ai: GLM 5
IQ 107#34 204,800 2 $1.045/1M $3.465/1M prepaid BYOK
z-ai/glm-5.1
Z.ai: GLM 5.1
IQ 113#19 202,752 2 $1.43/1M $4.73/1M prepaid BYOK
z-ai/glm-5.2
GLM 5.2
IQ 117#10 1,048,576 2 $1.54/1M $4.84/1M prepaid BYOK

Sign in

Choose a sign in method.