OpenAI compatible API. Attested gateway. Public status.

Phala

Phala models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway

1 URLbase_url migration

100smodels and routes

0prompt logs by default

`phala`

Confidential

All providers

Provider	Phala
Models	19 public models
Prepaid routes	19
BYOK routes	19
Zero data retention	yes
Confidential compute	yes
Provider E2EE	yes
Policy note	Tracked as a confidential AI provider with provider-side attestation and encrypted prompt transport. Policy source

Measured performance

259 samples

Continuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT	7868 ms
Throughput	5 tok/s
Uptime	88.42%

Model	p50 TTFT	p50 TTFB	Throughput	Uptime	Config excluded	Samples
z-ai/glm-4.7	3226 ms	3225 ms	—	100.00%	—	9
qwen/qwen2.5-vl-72b-instruct	3258 ms	3257 ms	—	100.00%	—	10
qwen/qwen3-vl-30b-a3b-instruct	3460 ms	3459 ms	—	100.00%	—	10
z-ai/glm-5.2	4078 ms	4077 ms	—	83.33%	—	12
openai/gpt-oss-120b	4514 ms	4513 ms	—	100.00%	—	16
moonshotai/kimi-k2.6	4632 ms	4631 ms	—	100.00%	—	10
z-ai/glm-5.1	4854 ms	4853 ms	—	75.00%	—	8
qwen/qwen3-30b-a3b-instruct-2507	5141 ms	5140 ms	—	100.00%	—	19
google/gemma-3-27b-it	5855 ms	5855 ms	—	84.62%	—	13
minimax/minimax-m2.5	6043 ms	6043 ms	—	100.00%	—	13
deepseek/deepseek-chat-v3.1	7868 ms	7867 ms	—	100.00%	—	11
z-ai/glm-5	9044 ms	9044 ms	—	94.44%	—	18
deepseek/deepseek-v3.2	9377 ms	9376 ms	—	100.00%	—	7
openai/gpt-oss-20b	9516 ms	9516 ms	—	95.24%	—	21
qwen/qwen3.5-27b	10421 ms	10420 ms	—	94.44%	—	18
moonshotai/kimi-k2.5	11252 ms	11251 ms	—	79.17%	—	24
qwen/qwen-2.5-7b-instruct	11687 ms	11687 ms	5 tok/s	100.00%	—	9
qwen/qwen3.5-397b-a17b	12219 ms	12218 ms	—	83.33%	—	18
z-ai/glm-4.7-flash	—	—	—	0.00%	—	13

Full provider & model leaderboard.

Provider models

Models served by Phala.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model	AI IQ	Context	Endpoints	Prompt	Completion	Routes
`deepseek/deepseek-chat-v3.1` DeepSeek: DeepSeek V3.1 benchmarks performance api	—	163,840	2	$1.155/1M	$3.41/1M	prepaid BYOK
`deepseek/deepseek-v3.2` DeepSeek: DeepSeek V3.2 benchmarks performance api	IQ 101#47	163,840	2	$0.352/1M	$0.528/1M	prepaid BYOK
`google/gemma-3-27b-it` Google: Gemma 3 27B benchmarks performance api	—	131,072	2	$0.121/1M	$0.44/1M	prepaid BYOK
`minimax/minimax-m2.5` MiniMax: MiniMax M2.5 benchmarks performance api	IQ 103#43	204,800	2	$0.22/1M	$1.518/1M	prepaid BYOK
`moonshotai/kimi-k2.5` MoonshotAI: Kimi K2.5 benchmarks performance api	IQ 109#29	262,144	2	$0.66/1M	$3.3/1M	prepaid BYOK
`moonshotai/kimi-k2.6` MoonshotAI: Kimi K2.6 benchmarks performance api	IQ 117#11	262,144	2	$1.199/1M	$5.06/1M	prepaid BYOK
`openai/gpt-oss-120b` OpenAI: gpt-oss-120b benchmarks performance api	IQ 95#59	131,072	2	$0.165/1M	$0.66/1M	prepaid BYOK
`openai/gpt-oss-20b` OpenAI: gpt-oss-20b benchmarks performance api	IQ 92#69	131,072	2	$0.044/1M	$0.165/1M	prepaid BYOK
`qwen/qwen-2.5-7b-instruct` Qwen: Qwen2.5 7B Instruct benchmarks performance api	—	131,072	2	$0.044/1M	$0.11/1M	prepaid BYOK
`qwen/qwen2.5-vl-72b-instruct` Qwen: Qwen2.5 VL 72B Instruct benchmarks performance api	—	131,072	2	$0.22/1M	$0.77/1M	prepaid BYOK
`qwen/qwen3-30b-a3b-instruct-2507` Qwen: Qwen3 30B A3B Instruct 2507 benchmarks performance api	—	131,072	2	$0.165/1M	$0.605/1M	prepaid BYOK
`qwen/qwen3-vl-30b-a3b-instruct` Qwen: Qwen3 VL 30B A3B Instruct benchmarks performance api	—	262,144	2	$0.22/1M	$0.77/1M	prepaid BYOK
`qwen/qwen3.5-27b` Qwen: Qwen3.5-27B benchmarks performance api	—	262,144	2	$0.33/1M	$2.64/1M	prepaid BYOK
`qwen/qwen3.5-397b-a17b` Qwen: Qwen3.5 397B A17B benchmarks performance api	—	262,144	2	$0.605/1M	$3.85/1M	prepaid BYOK
`z-ai/glm-4.7` Z.ai: GLM 4.7 benchmarks performance api	IQ 102#46	202,752	2	$0.935/1M	$3.63/1M	prepaid BYOK
`z-ai/glm-4.7-flash` Z.ai: GLM 4.7 Flash benchmarks performance api	—	202,752	2	$0.11/1M	$0.473/1M	prepaid BYOK
`z-ai/glm-5` Z.ai: GLM 5 benchmarks performance api	IQ 107#34	204,800	2	$1.32/1M	$3.85/1M	prepaid BYOK
`z-ai/glm-5.1` Z.ai: GLM 5.1 benchmarks performance api	IQ 113#19	202,752	2	$1.331/1M	$4.62/1M	prepaid BYOK
`z-ai/glm-5.2` GLM 5.2 benchmarks performance api	IQ 117#10	1,048,576	2	$1.54/1M	$4.84/1M	prepaid BYOK