OpenAI compatible API. Attested gateway. Public status.

Phala

Phala models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

phala

Confidential

All providers

ProviderPhala
Models19 public models
Prepaid routes19
BYOK routes19
Zero data retentionyes
Confidential computeyes
Provider E2EEyes
Policy noteTracked as a confidential AI provider with provider-side attestation and encrypted prompt transport.
Policy source

Measured performance

259 samples

Continuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT7868 ms
Throughput5 tok/s
Uptime88.42%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
z-ai/glm-4.7 3226 ms 3225 ms 100.00% 9
qwen/qwen2.5-vl-72b-instruct 3258 ms 3257 ms 100.00% 10
qwen/qwen3-vl-30b-a3b-instruct 3460 ms 3459 ms 100.00% 10
z-ai/glm-5.2 4078 ms 4077 ms 83.33% 12
openai/gpt-oss-120b 4514 ms 4513 ms 100.00% 16
moonshotai/kimi-k2.6 4632 ms 4631 ms 100.00% 10
z-ai/glm-5.1 4854 ms 4853 ms 75.00% 8
qwen/qwen3-30b-a3b-instruct-2507 5141 ms 5140 ms 100.00% 19
google/gemma-3-27b-it 5855 ms 5855 ms 84.62% 13
minimax/minimax-m2.5 6043 ms 6043 ms 100.00% 13
deepseek/deepseek-chat-v3.1 7868 ms 7867 ms 100.00% 11
z-ai/glm-5 9044 ms 9044 ms 94.44% 18
deepseek/deepseek-v3.2 9377 ms 9376 ms 100.00% 7
openai/gpt-oss-20b 9516 ms 9516 ms 95.24% 21
qwen/qwen3.5-27b 10421 ms 10420 ms 94.44% 18
moonshotai/kimi-k2.5 11252 ms 11251 ms 79.17% 24
qwen/qwen-2.5-7b-instruct 11687 ms 11687 ms 5 tok/s 100.00% 9
qwen/qwen3.5-397b-a17b 12219 ms 12218 ms 83.33% 18
z-ai/glm-4.7-flash 0.00% 13

Full provider & model leaderboard.

Provider models

Models served by Phala.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model AI IQ Context Endpoints Prompt Completion Routes
deepseek/deepseek-chat-v3.1
DeepSeek: DeepSeek V3.1
163,840 2 $1.155/1M $3.41/1M prepaid BYOK
deepseek/deepseek-v3.2
DeepSeek: DeepSeek V3.2
IQ 101#47 163,840 2 $0.352/1M $0.528/1M prepaid BYOK
google/gemma-3-27b-it
Google: Gemma 3 27B
131,072 2 $0.121/1M $0.44/1M prepaid BYOK
minimax/minimax-m2.5
MiniMax: MiniMax M2.5
IQ 103#43 204,800 2 $0.22/1M $1.518/1M prepaid BYOK
moonshotai/kimi-k2.5
MoonshotAI: Kimi K2.5
IQ 109#29 262,144 2 $0.66/1M $3.3/1M prepaid BYOK
moonshotai/kimi-k2.6
MoonshotAI: Kimi K2.6
IQ 117#11 262,144 2 $1.199/1M $5.06/1M prepaid BYOK
openai/gpt-oss-120b
OpenAI: gpt-oss-120b
IQ 95#59 131,072 2 $0.165/1M $0.66/1M prepaid BYOK
openai/gpt-oss-20b
OpenAI: gpt-oss-20b
IQ 92#69 131,072 2 $0.044/1M $0.165/1M prepaid BYOK
qwen/qwen-2.5-7b-instruct
Qwen: Qwen2.5 7B Instruct
131,072 2 $0.044/1M $0.11/1M prepaid BYOK
qwen/qwen2.5-vl-72b-instruct
Qwen: Qwen2.5 VL 72B Instruct
131,072 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3-30b-a3b-instruct-2507
Qwen: Qwen3 30B A3B Instruct 2507
131,072 2 $0.165/1M $0.605/1M prepaid BYOK
qwen/qwen3-vl-30b-a3b-instruct
Qwen: Qwen3 VL 30B A3B Instruct
262,144 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3.5-27b
Qwen: Qwen3.5-27B
262,144 2 $0.33/1M $2.64/1M prepaid BYOK
qwen/qwen3.5-397b-a17b
Qwen: Qwen3.5 397B A17B
262,144 2 $0.605/1M $3.85/1M prepaid BYOK
z-ai/glm-4.7
Z.ai: GLM 4.7
IQ 102#46 202,752 2 $0.935/1M $3.63/1M prepaid BYOK
z-ai/glm-4.7-flash
Z.ai: GLM 4.7 Flash
202,752 2 $0.11/1M $0.473/1M prepaid BYOK
z-ai/glm-5
Z.ai: GLM 5
IQ 107#34 204,800 2 $1.32/1M $3.85/1M prepaid BYOK
z-ai/glm-5.1
Z.ai: GLM 5.1
IQ 113#19 202,752 2 $1.331/1M $4.62/1M prepaid BYOK
z-ai/glm-5.2
GLM 5.2
IQ 117#10 1,048,576 2 $1.54/1M $4.84/1M prepaid BYOK

Sign in

Choose a sign in method.