OpenAI compatible API. Attested gateway. Public status.
GMI Cloud
GMI Cloud models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
gmi
No provider claim
| Provider | GMI Cloud |
|---|---|
| Models | 9 public models |
| Prepaid routes | 4 |
| BYOK routes | 9 |
| Zero data retention | not claimed |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | GMI runs isolated/VPC GPU inference, but that is network isolation, NOT an attested TEE — so no confidential-compute, zero-retention, or E2EE claim is marked. Retention/training terms are unverified (the published policy page is JavaScript-only and would not render). Policy source |
Measured performance
259 samplesContinuously sampled across GMI Cloud's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 7567 ms |
|---|---|
| Throughput | — |
| Uptime | 87.26% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| z-ai/glm-5 | 6613 ms | 6613 ms | — | 55.93% | — | 59 |
| deepseek/deepseek-v4-pro | 7567 ms | 7566 ms | — | 100.00% | — | 74 |
| z-ai/glm-5.2 | 7959 ms | 7959 ms | — | 93.55% | — | 62 |
| z-ai/glm-5.1 | 8951 ms | 8950 ms | — | 95.31% | — | 64 |
Provider models
Models served by GMI Cloud.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
anthropic/claude-opus-4.7Anthropic: Claude Opus 4.7 |
IQ 127#4 | 1,000,000 | 1 | $5.5/1M | $27.5/1M | BYOK |
deepseek/deepseek-v4-proDeepSeek: DeepSeek V4 Pro |
IQ 109#28 | 1,048,576 | 2 | $1.5312/1M | $3.0624/1M | prepaid BYOK |
google/gemma-4-26b-a4b-itGoogle: Gemma 4 26B A4B |
IQ 94#63 | 262,144 | 1 | $0.143/1M | $0.44/1M | BYOK |
google/gemma-4-31b-itGoogle: Gemma 4 31B |
IQ 98#52 | 262,144 | 1 | $0.154/1M | $0.44/1M | BYOK |
openai/gpt-5.4-nanoOpenAI: GPT-5.4 Nano |
IQ 105#36 | 400,000 | 1 | $0.22/1M | $1.375/1M | BYOK |
openai/gpt-5.5OpenAI: GPT-5.5 |
IQ 129#3 | 1,050,000 | 1 | $5.5/1M | $33/1M | BYOK |
z-ai/glm-5Z.ai: GLM 5 |
IQ 107#34 | 204,800 | 2 | $0.66/1M | $2.112/1M | prepaid BYOK |
z-ai/glm-5.1Z.ai: GLM 5.1 |
IQ 113#19 | 202,752 | 2 | $1.078/1M | $3.388/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.078/1M | $3.388/1M | prepaid BYOK |