OpenAI compatible API. Attested gateway. Public status.
Phala
Phala models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
phala
Confidential
| Provider | Phala |
|---|---|
| Models | 19 public models |
| Prepaid routes | 19 |
| BYOK routes | 19 |
| Zero data retention | yes |
| Confidential compute | yes |
| Provider E2EE | yes |
| Policy note | Tracked as a confidential AI provider with provider-side attestation and encrypted prompt transport. Policy source |
Measured performance
259 samplesContinuously sampled across Phala's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 7868 ms |
|---|---|
| Throughput | 5 tok/s |
| Uptime | 88.42% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| z-ai/glm-4.7 | 3226 ms | 3225 ms | — | 100.00% | — | 9 |
| qwen/qwen2.5-vl-72b-instruct | 3258 ms | 3257 ms | — | 100.00% | — | 10 |
| qwen/qwen3-vl-30b-a3b-instruct | 3460 ms | 3459 ms | — | 100.00% | — | 10 |
| z-ai/glm-5.2 | 4078 ms | 4077 ms | — | 83.33% | — | 12 |
| openai/gpt-oss-120b | 4514 ms | 4513 ms | — | 100.00% | — | 16 |
| moonshotai/kimi-k2.6 | 4632 ms | 4631 ms | — | 100.00% | — | 10 |
| z-ai/glm-5.1 | 4854 ms | 4853 ms | — | 75.00% | — | 8 |
| qwen/qwen3-30b-a3b-instruct-2507 | 5141 ms | 5140 ms | — | 100.00% | — | 19 |
| google/gemma-3-27b-it | 5855 ms | 5855 ms | — | 84.62% | — | 13 |
| minimax/minimax-m2.5 | 6043 ms | 6043 ms | — | 100.00% | — | 13 |
| deepseek/deepseek-chat-v3.1 | 7868 ms | 7867 ms | — | 100.00% | — | 11 |
| z-ai/glm-5 | 9044 ms | 9044 ms | — | 94.44% | — | 18 |
| deepseek/deepseek-v3.2 | 9377 ms | 9376 ms | — | 100.00% | — | 7 |
| openai/gpt-oss-20b | 9516 ms | 9516 ms | — | 95.24% | — | 21 |
| qwen/qwen3.5-27b | 10421 ms | 10420 ms | — | 94.44% | — | 18 |
| moonshotai/kimi-k2.5 | 11252 ms | 11251 ms | — | 79.17% | — | 24 |
| qwen/qwen-2.5-7b-instruct | 11687 ms | 11687 ms | 5 tok/s | 100.00% | — | 9 |
| qwen/qwen3.5-397b-a17b | 12219 ms | 12218 ms | — | 83.33% | — | 18 |
| z-ai/glm-4.7-flash | — | — | — | 0.00% | — | 13 |
Provider models
Models served by Phala.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
deepseek/deepseek-chat-v3.1DeepSeek: DeepSeek V3.1 |
— | 163,840 | 2 | $1.155/1M | $3.41/1M | prepaid BYOK |
deepseek/deepseek-v3.2DeepSeek: DeepSeek V3.2 |
IQ 101#47 | 163,840 | 2 | $0.352/1M | $0.528/1M | prepaid BYOK |
google/gemma-3-27b-itGoogle: Gemma 3 27B |
— | 131,072 | 2 | $0.121/1M | $0.44/1M | prepaid BYOK |
minimax/minimax-m2.5MiniMax: MiniMax M2.5 |
IQ 103#43 | 204,800 | 2 | $0.22/1M | $1.518/1M | prepaid BYOK |
moonshotai/kimi-k2.5MoonshotAI: Kimi K2.5 |
IQ 109#29 | 262,144 | 2 | $0.66/1M | $3.3/1M | prepaid BYOK |
moonshotai/kimi-k2.6MoonshotAI: Kimi K2.6 |
IQ 117#11 | 262,144 | 2 | $1.199/1M | $5.06/1M | prepaid BYOK |
openai/gpt-oss-120bOpenAI: gpt-oss-120b |
IQ 95#59 | 131,072 | 2 | $0.165/1M | $0.66/1M | prepaid BYOK |
openai/gpt-oss-20bOpenAI: gpt-oss-20b |
IQ 92#69 | 131,072 | 2 | $0.044/1M | $0.165/1M | prepaid BYOK |
qwen/qwen-2.5-7b-instructQwen: Qwen2.5 7B Instruct |
— | 131,072 | 2 | $0.044/1M | $0.11/1M | prepaid BYOK |
qwen/qwen2.5-vl-72b-instructQwen: Qwen2.5 VL 72B Instruct |
— | 131,072 | 2 | $0.22/1M | $0.77/1M | prepaid BYOK |
qwen/qwen3-30b-a3b-instruct-2507Qwen: Qwen3 30B A3B Instruct 2507 |
— | 131,072 | 2 | $0.165/1M | $0.605/1M | prepaid BYOK |
qwen/qwen3-vl-30b-a3b-instructQwen: Qwen3 VL 30B A3B Instruct |
— | 262,144 | 2 | $0.22/1M | $0.77/1M | prepaid BYOK |
qwen/qwen3.5-27bQwen: Qwen3.5-27B |
— | 262,144 | 2 | $0.33/1M | $2.64/1M | prepaid BYOK |
qwen/qwen3.5-397b-a17bQwen: Qwen3.5 397B A17B |
— | 262,144 | 2 | $0.605/1M | $3.85/1M | prepaid BYOK |
z-ai/glm-4.7Z.ai: GLM 4.7 |
IQ 102#46 | 202,752 | 2 | $0.935/1M | $3.63/1M | prepaid BYOK |
z-ai/glm-4.7-flashZ.ai: GLM 4.7 Flash |
— | 202,752 | 2 | $0.11/1M | $0.473/1M | prepaid BYOK |
z-ai/glm-5Z.ai: GLM 5 |
IQ 107#34 | 204,800 | 2 | $1.32/1M | $3.85/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
IQ 113#19 | 202,752 | 2 | $1.331/1M | $4.62/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.54/1M | $4.84/1M | prepaid BYOK |