OpenAI compatible API. Attested gateway. Public status.

DeepInfra

DeepInfra models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

deepinfra

No logs

All providers

ProviderDeepInfra
Models9 public models
Prepaid routes9
BYOK routes9
Zero data retentionyes
Confidential computenot claimed
Provider E2EEnot claimed
Policy noteTracked as provider ZDR — DeepInfra documents memory-only handling with no storage of API content and no training on submitted API data. (Exception: requests to Google/Anthropic-backed models inherit those vendors' policies.)
Policy source

Measured performance

259 samples

Continuously sampled across DeepInfra's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT6291 ms
Throughput
Uptime98.07%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
qwen/qwen3.5-27b 2420 ms 2419 ms 100.00% 30
google/gemma-4-26b-a4b-it 2973 ms 2972 ms 100.00% 26
google/gemma-3-12b-it 3437 ms 3436 ms 97.06% 34
google/gemma-4-31b-it 5227 ms 5226 ms 100.00% 34
google/gemma-3-27b-it 6291 ms 6290 ms 97.06% 34
google/gemma-3-4b-it 6502 ms 6501 ms 97.22% 36
meta-llama/llama-3.1-70b-instruct 8765 ms 8764 ms 100.00% 35
z-ai/glm-5.2 10005 ms 10005 ms 93.10% 29
Qwen/Qwen3-Embedding-8B 100.00% 1

Full provider & model leaderboard.

Provider models

Models served by DeepInfra.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model AI IQ Context Endpoints Prompt Completion Routes
Qwen/Qwen3-Embedding-8B
Qwen3 Embedding 8B
32,000 2 $0.011/1M selected route prepaid BYOK
google/gemma-3-12b-it
Google: Gemma 3 12B
131,072 2 $0.055/1M $0.165/1M prepaid BYOK
google/gemma-3-27b-it
Google: Gemma 3 27B
131,072 2 $0.088/1M $0.176/1M prepaid BYOK
google/gemma-3-4b-it
Google: Gemma 3 4B
131,072 2 $0.055/1M $0.11/1M prepaid BYOK
google/gemma-4-26b-a4b-it
Google: Gemma 4 26B A4B
IQ 94#63 262,144 2 $0.077/1M $0.374/1M prepaid BYOK
google/gemma-4-31b-it
Google: Gemma 4 31B
IQ 98#52 262,144 2 $0.143/1M $0.418/1M prepaid BYOK
meta-llama/llama-3.1-70b-instruct
Meta: Llama 3.1 70B Instruct
131,072 2 $0.44/1M $0.44/1M prepaid BYOK
qwen/qwen3.5-27b
Qwen: Qwen3.5-27B
262,144 2 $0.286/1M $2.86/1M prepaid BYOK
z-ai/glm-5.2
GLM 5.2
IQ 117#10 1,048,576 2 $1.32/1M $4.62/1M prepaid BYOK

Sign in

Choose a sign in method.