OpenAI compatible API. Attested gateway. Public status.
DeepInfra
DeepInfra models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
deepinfra
No logs
| Provider | DeepInfra |
|---|---|
| Models | 9 public models |
| Prepaid routes | 9 |
| BYOK routes | 9 |
| Zero data retention | yes |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | Tracked as provider ZDR — DeepInfra documents memory-only handling with no storage of API content and no training on submitted API data. (Exception: requests to Google/Anthropic-backed models inherit those vendors' policies.) Policy source |
Measured performance
259 samplesContinuously sampled across DeepInfra's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 6291 ms |
|---|---|
| Throughput | — |
| Uptime | 98.07% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| qwen/qwen3.5-27b | 2420 ms | 2419 ms | — | 100.00% | — | 30 |
| google/gemma-4-26b-a4b-it | 2973 ms | 2972 ms | — | 100.00% | — | 26 |
| google/gemma-3-12b-it | 3437 ms | 3436 ms | — | 97.06% | — | 34 |
| google/gemma-4-31b-it | 5227 ms | 5226 ms | — | 100.00% | — | 34 |
| google/gemma-3-27b-it | 6291 ms | 6290 ms | — | 97.06% | — | 34 |
| google/gemma-3-4b-it | 6502 ms | 6501 ms | — | 97.22% | — | 36 |
| meta-llama/llama-3.1-70b-instruct | 8765 ms | 8764 ms | — | 100.00% | — | 35 |
| z-ai/glm-5.2 | 10005 ms | 10005 ms | — | 93.10% | — | 29 |
| Qwen/Qwen3-Embedding-8B | — | — | — | 100.00% | — | 1 |
Provider models
Models served by DeepInfra.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
Qwen/Qwen3-Embedding-8BQwen3 Embedding 8B |
— | 32,000 | 2 | $0.011/1M | selected route | prepaid BYOK |
google/gemma-3-12b-itGoogle: Gemma 3 12B |
— | 131,072 | 2 | $0.055/1M | $0.165/1M | prepaid BYOK |
google/gemma-3-27b-itGoogle: Gemma 3 27B |
— | 131,072 | 2 | $0.088/1M | $0.176/1M | prepaid BYOK |
google/gemma-3-4b-itGoogle: Gemma 3 4B |
— | 131,072 | 2 | $0.055/1M | $0.11/1M | prepaid BYOK |
google/gemma-4-26b-a4b-itGoogle: Gemma 4 26B A4B |
IQ 94#63 | 262,144 | 2 | $0.077/1M | $0.374/1M | prepaid BYOK |
google/gemma-4-31b-itGoogle: Gemma 4 31B |
IQ 98#52 | 262,144 | 2 | $0.143/1M | $0.418/1M | prepaid BYOK |
meta-llama/llama-3.1-70b-instructMeta: Llama 3.1 70B Instruct |
— | 131,072 | 2 | $0.44/1M | $0.44/1M | prepaid BYOK |
qwen/qwen3.5-27bQwen: Qwen3.5-27B |
— | 262,144 | 2 | $0.286/1M | $2.86/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.32/1M | $4.62/1M | prepaid BYOK |