OpenAI compatible API. Attested gateway. Public status.
Crusoe
Crusoe models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
crusoe
No provider claim
| Provider | Crusoe |
|---|---|
| Models | 15 public models |
| Prepaid routes | 15 |
| BYOK routes | 15 |
| Zero data retention | not claimed |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | No provider-ZDR claim is tracked here. Crusoe's Managed Inference docs and pricing/catalog pages are linked for model and API data-handling review. Policy source |
Measured performance
228 samplesContinuously sampled across Crusoe's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 5471 ms |
|---|---|
| Throughput | — |
| Uptime | 91.23% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| moonshotai/kimi-k2.6 | 1836 ms | 1836 ms | — | 33.33% | — | 18 |
| yutori/n1.5 | 2136 ms | 2135 ms | — | 100.00% | — | 20 |
| z-ai/glm-5.2 | 2393 ms | 2392 ms | — | 100.00% | — | 12 |
| nvidia/nemotron-3-ultra-550b | 3307 ms | 3307 ms | — | 92.31% | — | 13 |
| google/gemma-4-31b-it | 4449 ms | 4449 ms | — | 94.74% | — | 19 |
| nvidia/nemotron-3-super-120b-a12b | 4613 ms | 4612 ms | — | 100.00% | 14 probe_config_error |
3 |
| deepseek/deepseek-v3-0324 | 4807 ms | 4806 ms | — | 100.00% | — | 21 |
| meta-llama/llama-3.3-70b-instruct | 5471 ms | 5471 ms | — | 100.00% | — | 22 |
| deepseek/deepseek-v4-pro | 5620 ms | 5619 ms | — | 88.89% | — | 18 |
| openai/gpt-oss-120b | 5623 ms | 5623 ms | — | 100.00% | — | 20 |
| qwen/qwen3-235b-a22b-2507 | 6109 ms | 6108 ms | — | 94.74% | — | 19 |
| z-ai/glm-5.1 | 6129 ms | 6129 ms | — | 100.00% | — | 16 |
| deepseek/deepseek-v4-flash | 6831 ms | 6830 ms | — | 100.00% | — | 14 |
| nvidia/nemotron-3-nano-omni-reasoning-30b-a3b | 10150 ms | 10149 ms | — | 90.91% | — | 11 |
| nvidia/nemotron-3-nano-30b-a3b | — | — | — | 0.00% | 17 probe_config_error |
2 |
Provider models
Models served by Crusoe.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
deepseek/deepseek-v3-0324DeepSeek V3 0324 |
— | 163,840 | 2 | $0.55/1M | $1.65/1M | prepaid BYOK |
deepseek/deepseek-v4-flashDeepSeek: DeepSeek V4 Flash |
IQ 104#38 | 1,048,576 | 2 | $0.154/1M | $0.308/1M | prepaid BYOK |
deepseek/deepseek-v4-proDeepSeek: DeepSeek V4 Pro |
IQ 109#28 | 1,048,576 | 2 | $1.914/1M | $3.828/1M | prepaid BYOK |
google/gemma-4-31b-itGoogle: Gemma 4 31B |
IQ 98#52 | 262,144 | 2 | $0.154/1M | $0.44/1M | prepaid BYOK |
meta-llama/llama-3.3-70b-instructMeta: Llama 3.3 70B Instruct |
— | 131,072 | 2 | $0.275/1M | $0.825/1M | prepaid BYOK |
moonshotai/kimi-k2.6MoonshotAI: Kimi K2.6 |
IQ 117#11 | 262,144 | 2 | $0.77/1M | $3.85/1M | prepaid BYOK |
nvidia/nemotron-3-nano-30b-a3bnvidia/NVIDIA-Nemotron-3-Nano-30B-A3B |
— | 262,144 | 2 | $0.055/1M | $0.22/1M | prepaid BYOK |
nvidia/nemotron-3-nano-omni-reasoning-30b-a3bnvidia/Nemotron-3-Nano-Omni-Reasoning-30B-A3B |
— | 262,144 | 2 | $0.33/1M | $2.013/1M | prepaid BYOK |
nvidia/nemotron-3-super-120b-a12bnemotron 3 super 120b a12b |
— | 131,072 | 2 | $0.33/1M | $2.64/1M | prepaid BYOK |
nvidia/nemotron-3-ultra-550bnvidia/NVIDIA-Nemotron-3-Ultra-550B |
— | 262,144 | 2 | $1.1/1M | $3.52/1M | prepaid BYOK |
openai/gpt-oss-120bOpenAI: gpt-oss-120b |
IQ 95#59 | 131,072 | 2 | $0.055/1M | $0.275/1M | prepaid BYOK |
qwen/qwen3-235b-a22b-2507Qwen: Qwen3 235B A22B Instruct 2507 |
— | 262,144 | 2 | $0.242/1M | $0.88/1M | prepaid BYOK |
yutori/n1.5yutori/n1.5 |
— | 128,000 | 2 | $1.65/1M | $5.5/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
IQ 113#19 | 202,752 | 2 | $1.32/1M | $4.84/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.54/1M | $4.84/1M | prepaid BYOK |