OpenAI compatible API. Attested gateway. Public status.
Baseten
Baseten models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
baseten
No provider claim
| Provider | Baseten |
|---|---|
| Models | 11 public models |
| Prepaid routes | 11 |
| BYOK routes | 11 |
| Zero data retention | not claimed |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | No provider-ZDR claim is tracked here. Baseten's inference and security documentation are linked for users who need to review API data handling. Policy source |
Measured performance
259 samplesContinuously sampled across Baseten's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 7260 ms |
|---|---|
| Throughput | 82 tok/s |
| Uptime | 97.68% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| z-ai/glm-5.2 | 5469 ms | 5468 ms | 78 tok/s | 93.33% | — | 75 |
| moonshotai/kimi-k2.7-code | 7140 ms | 7139 ms | 85 tok/s | 100.00% | — | 45 |
| deepseek/deepseek-v4-pro | 7260 ms | 7259 ms | 82 tok/s | 100.00% | — | 44 |
| z-ai/glm-5 | 8012 ms | 8011 ms | — | 91.67% | — | 12 |
| z-ai/glm-5.1 | 8163 ms | 8162 ms | — | 100.00% | — | 11 |
| nvidia/nemotron-120b-a12b | 8751 ms | 8751 ms | — | 100.00% | — | 12 |
| z-ai/glm-4.7 | 8923 ms | 8923 ms | — | 100.00% | — | 9 |
| moonshotai/kimi-k2.5 | 9280 ms | 9279 ms | — | 100.00% | — | 19 |
| openai/gpt-oss-120b | 10628 ms | 10627 ms | — | 100.00% | — | 15 |
| moonshotai/kimi-k2.6 | 11430 ms | 11429 ms | — | 100.00% | — | 7 |
| nvidia/nvidia-nemotron-3-ultra-550b-a55b | 12384 ms | 12383 ms | — | 100.00% | — | 10 |
Provider models
Models served by Baseten.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
deepseek/deepseek-v4-proDeepSeek: DeepSeek V4 Pro |
IQ 109#28 | 1,048,576 | 2 | $1.914/1M | $3.828/1M | prepaid BYOK |
moonshotai/kimi-k2.5MoonshotAI: Kimi K2.5 |
IQ 109#29 | 262,144 | 2 | $0.66/1M | $3.3/1M | prepaid BYOK |
moonshotai/kimi-k2.6MoonshotAI: Kimi K2.6 |
IQ 117#11 | 262,144 | 2 | $1.045/1M | $4.4/1M | prepaid BYOK |
moonshotai/kimi-k2.7-codeMoonshotAI: Kimi K2.7 Code |
IQ 116#13 | 262,144 | 2 | $1.045/1M | $4.4/1M | prepaid BYOK |
nvidia/nemotron-120b-a12bNemotron 120B A12B |
— | 202,800 | 2 | $0.33/1M | $0.825/1M | prepaid BYOK |
nvidia/nvidia-nemotron-3-ultra-550b-a55bNVIDIA Nemotron 3 Ultra 550B A55B |
— | 202,800 | 2 | $0.66/1M | $2.64/1M | prepaid BYOK |
openai/gpt-oss-120bOpenAI: gpt-oss-120b |
IQ 95#59 | 131,072 | 2 | $0.11/1M | $0.55/1M | prepaid BYOK |
z-ai/glm-4.7Z.ai: GLM 4.7 |
IQ 102#46 | 202,752 | 2 | $0.66/1M | $2.42/1M | prepaid BYOK |
z-ai/glm-5Z.ai: GLM 5 |
IQ 107#34 | 204,800 | 2 | $1.045/1M | $3.465/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
IQ 113#19 | 202,752 | 2 | $1.43/1M | $4.73/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.54/1M | $4.84/1M | prepaid BYOK |