OpenAI compatible API. Attested gateway. Public status.
Novita AI
Novita AI models on TrustedRouter with prices, routes, policy notes, and source links.
1 URLbase_url migration
100smodels and routes
0prompt logs by default
novita
No provider claim
| Provider | Novita AI |
|---|---|
| Models | 101 public models |
| Prepaid routes | 83 |
| BYOK routes | 101 |
| Zero data retention | not claimed |
| Confidential compute | not claimed |
| Provider E2EE | not claimed |
| Policy note | No provider-ZDR claim is tracked here. Novita's privacy policy says personal information is not used for model training; customer-content processing is governed by customer agreements. Policy source |
Measured performance
259 samplesContinuously sampled across Novita AI's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.
| p50 TTFT | 7094 ms |
|---|---|
| Throughput | — |
| Uptime | 88.03% |
| Model | p50 TTFT | p50 TTFB | Throughput | Uptime | Config excluded | Samples |
|---|---|---|---|---|---|---|
| mistralai/mistral-nemo | 1225 ms | 1225 ms | — | 100.00% | — | 3 |
| qwen/qwen3.5-35b-a3b | 1329 ms | 1329 ms | — | 100.00% | — | 2 |
| qwen/qwen3-235b-a22b-thinking-2507 | 1444 ms | 1443 ms | — | 100.00% | — | 2 |
| qwen/qwen3-coder-480b-a35b-instruct | 1468 ms | 1468 ms | — | 75.00% | — | 4 |
| qwen/qwen-2.5-72b-instruct | 1487 ms | 1487 ms | — | 100.00% | — | 2 |
| deepseek/deepseek-r1-turbo | 1507 ms | 1506 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.7-flash | 1545 ms | 1545 ms | — | 66.67% | — | 3 |
| deepseek/deepseek-v3.2-exp | 1616 ms | 1615 ms | — | 100.00% | — | 3 |
| qwen/qwen3-235b-a22b-instruct-2507 | 1787 ms | 1786 ms | — | 100.00% | — | 2 |
| inclusionai/ling-2.6-1t | 1826 ms | 1825 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-ocr | 1895 ms | 1894 ms | — | 100.00% | — | 3 |
| qwen/qwen3-max | 1909 ms | 1908 ms | — | 100.00% | — | 2 |
| qwen/qwen3-coder-30b-a3b-instruct | 2010 ms | 2009 ms | — | 100.00% | — | 2 |
| deepseek/deepseek-v3.1-terminus | 2013 ms | 2012 ms | — | 100.00% | — | 2 |
| deepseek/deepseek-v3.2 | 2074 ms | 2073 ms | — | 100.00% | — | 2 |
| meta-llama/llama-3.1-8b-instruct | 2077 ms | 2076 ms | — | 100.00% | — | 2 |
| deepseek/deepseek-r1-distill-llama-70b | 2140 ms | 2139 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-v4-pro | 2233 ms | 2232 ms | — | 100.00% | — | 2 |
| openai/gpt-oss-120b | 2291 ms | 2290 ms | — | 100.00% | — | 4 |
| kwaipilot/kat-coder-pro | 2391 ms | 2390 ms | — | 100.00% | — | 3 |
| minimax/minimax-m2.5-highspeed | 2865 ms | 2864 ms | — | 100.00% | — | 2 |
| qwen/qwen3-vl-30b-a3b-instruct | 2936 ms | 2936 ms | — | 100.00% | — | 4 |
| minimax/minimax-m2.7 | 2970 ms | 2969 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.6 | 3318 ms | 3318 ms | — | 100.00% | — | 2 |
| sao10k/l3-8b-lunaris | 3563 ms | 3563 ms | — | 100.00% | — | 2 |
| qwen/qwen3-next-80b-a3b-instruct | 3590 ms | 3589 ms | — | 100.00% | — | 4 |
| moonshotai/kimi-k2-0905 | 3695 ms | 3695 ms | — | 100.00% | — | 3 |
| meta-llama/llama-4-maverick-17b-128e-instruct-fp8 | 3908 ms | 3908 ms | — | 100.00% | — | 4 |
| zai-org/glm-5 | 3954 ms | 3953 ms | — | 100.00% | — | 4 |
| microsoft/wizardlm-2-8x22b | 3991 ms | 3991 ms | — | 100.00% | — | 2 |
| zai-org/glm-4.7 | 4630 ms | 4629 ms | — | 100.00% | — | 6 |
| moonshotai/kimi-k2.6 | 5094 ms | 5093 ms | — | 100.00% | — | 4 |
| qwen/qwen3.5-397b-a17b | 5208 ms | 5208 ms | — | 100.00% | — | 6 |
| sao10k/l31-70b-euryale-v2.2 | 5219 ms | 5219 ms | — | 100.00% | — | 2 |
| qwen/qwen3-coder-next | 5285 ms | 5285 ms | — | 100.00% | — | 1 |
| deepseek/deepseek-v3-0324 | 5544 ms | 5544 ms | — | 100.00% | — | 3 |
| moonshotai/kimi-k2-instruct | 5592 ms | 5591 ms | — | 100.00% | — | 3 |
| qwen/qwen3-omni-30b-a3b-instruct | 5758 ms | 5757 ms | — | 100.00% | — | 3 |
| inclusionai/ring-2.6-1t | 5943 ms | 5943 ms | — | 100.00% | — | 3 |
| qwen/qwen3-235b-a22b-fp8 | 7094 ms | 7094 ms | — | 87.50% | — | 8 |
| zai-org/glm-4.5-air | 7286 ms | 7285 ms | — | 100.00% | — | 4 |
| minimax/minimax-m2.1 | 7310 ms | 7310 ms | — | 100.00% | — | 4 |
| zai-org/glm-4.5v | 7364 ms | 7364 ms | — | 100.00% | — | 3 |
| xiaomimimo/mimo-v2.5-pro | 7456 ms | 7455 ms | — | 100.00% | — | 3 |
| qwen/qwen3-vl-235b-a22b-thinking | 7518 ms | 7518 ms | — | 100.00% | — | 4 |
| google/gemma-3-27b-it | 7540 ms | 7540 ms | — | 100.00% | — | 3 |
| minimax/minimax-m2.5 | 7556 ms | 7555 ms | — | 100.00% | — | 3 |
| qwen/qwen3-vl-235b-a22b-instruct | 7793 ms | 7792 ms | — | 100.00% | — | 5 |
| qwen/qwen3.6-35b-a3b | 7842 ms | 7841 ms | — | 100.00% | — | 3 |
| inclusionai/ling-2.6-flash | 7944 ms | 7944 ms | — | 75.00% | — | 4 |
| minimax/minimax-m2 | 8099 ms | 8098 ms | — | 100.00% | — | 4 |
| qwen/qwen3.6-27b | 8290 ms | 8290 ms | — | 100.00% | — | 2 |
| minimaxai/minimax-m1-80k | 8423 ms | 8422 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-v3-turbo | 8522 ms | 8522 ms | — | 100.00% | — | 6 |
| deepseek/deepseek-v4-flash | 8623 ms | 8623 ms | — | 100.00% | — | 7 |
| zai-org/glm-5.1 | 8654 ms | 8653 ms | — | 100.00% | — | 2 |
| qwen/qwen-mt-plus | 8719 ms | 8718 ms | — | 100.00% | — | 6 |
| meta-llama/llama-4-scout-17b-16e-instruct | 9171 ms | 9170 ms | — | 100.00% | — | 5 |
| zai-org/autoglm-phone-9b-multilingual | 9359 ms | 9358 ms | — | 100.00% | — | 2 |
| baidu/ernie-4.5-vl-424b-a47b | 9394 ms | 9393 ms | — | 100.00% | — | 6 |
| google/gemma-4-26b-a4b-it | 10034 ms | 10033 ms | — | 100.00% | — | 3 |
| zai-org/glm-4.6v | 10058 ms | 10058 ms | — | 100.00% | — | 6 |
| qwen/qwen3.5-122b-a10b | 10070 ms | 10070 ms | — | 100.00% | — | 6 |
| moonshotai/kimi-k2.5 | 10409 ms | 10408 ms | — | 100.00% | — | 1 |
| deepseek/deepseek-ocr-2 | 10569 ms | 10568 ms | — | 100.00% | — | 3 |
| deepseek/deepseek-r1-0528 | 11805 ms | 11804 ms | — | 100.00% | — | 3 |
| openai/gpt-oss-20b | 14010 ms | 14009 ms | — | 100.00% | — | 1 |
| qwen/qwen3-omni-30b-a3b-thinking | 14980 ms | 14979 ms | — | 100.00% | — | 1 |
| deepseek/deepseek-v3.1 | 14982 ms | 14982 ms | — | 100.00% | — | 2 |
| moonshotai/kimi-k2.7-code | 15450 ms | 15449 ms | — | 100.00% | — | 1 |
| Sao10K/L3-8B-Stheno-v3.2 | 16492 ms | 16491 ms | — | 100.00% | — | 5 |
| zai-org/glm-4.5 | — | — | — | 0.00% | — | 2 |
| baidu/ernie-4.5-21B-a3b | — | — | — | 0.00% | — | 4 |
| google/gemma-3-12b-it | — | — | — | 0.00% | — | 4 |
| meta-llama/llama-3-70b-instruct | — | — | — | 0.00% | — | 2 |
| moonshotai/kimi-k2-thinking | — | — | — | 0.00% | — | 4 |
| meta-llama/llama-3.3-70b-instruct | — | — | — | 0.00% | — | 3 |
| elephant | — | — | — | 0.00% | — | 3 |
| baichuan/baichuan-m2-32b | — | — | — | 0.00% | — | 2 |
| baidu/ernie-4.5-vl-28b-a3b | — | — | — | 0.00% | — | 3 |
Provider models
Models served by Novita AI.
Each row links to pricing, provider, benchmark, and API pages for the model.
| Model | AI IQ | Context | Endpoints | Prompt | Completion | Routes |
|---|---|---|---|---|---|---|
Sao10K/L3-8B-Stheno-v3.2L3 8B Stheno V3.2 |
— | 8,192 | 2 | $0.055/1M | $0.055/1M | prepaid BYOK |
baichuan/baichuan-m2-32bBaiChuan M2 32B |
— | 131,072 | 2 | $0.077/1M | $0.077/1M | prepaid BYOK |
baidu/ernie-4.5-21B-a3bERNIE 4.5 21B A3B |
— | 120,000 | 2 | $0.077/1M | $0.308/1M | prepaid BYOK |
baidu/ernie-4.5-21B-a3b-thinkingERNIE-4.5-21B-A3B-Thinking |
— | 131,072 | 1 | $0.077/1M | $0.308/1M | BYOK |
baidu/ernie-4.5-300b-a47b-paddleERNIE 4.5 300B A47B |
— | 123,000 | 1 | $0.308/1M | $1.21/1M | BYOK |
baidu/ernie-4.5-vl-28b-a3bERNIE 4.5 VL 28B A3B |
— | 30,000 | 2 | $0.154/1M | $0.616/1M | prepaid BYOK |
baidu/ernie-4.5-vl-28b-a3b-thinkingERNIE-4.5-VL-28B-A3B-Thinking |
— | 131,072 | 1 | $0.429/1M | $0.429/1M | BYOK |
baidu/ernie-4.5-vl-424b-a47bERNIE 4.5 VL 424B A47B |
— | 123,000 | 2 | $0.462/1M | $1.375/1M | prepaid BYOK |
deepseek/deepseek-ocrDeepSeek-OCR |
— | 8,192 | 2 | $0.033/1M | $0.033/1M | prepaid BYOK |
deepseek/deepseek-ocr-2DeepSeek-OCR 2 |
— | 8,192 | 2 | $0.033/1M | $0.033/1M | prepaid BYOK |
deepseek/deepseek-prover-v2-671bDeepseek Prover V2 671B |
— | 160,000 | 1 | $0.77/1M | $2.75/1M | BYOK |
deepseek/deepseek-r1-0528DeepSeek R1 0528 |
— | 163,840 | 2 | $0.77/1M | $2.75/1M | prepaid BYOK |
deepseek/deepseek-r1-0528-qwen3-8bDeepSeek R1 0528 Qwen3 8B |
— | 128,000 | 1 | $0.066/1M | $0.099/1M | BYOK |
deepseek/deepseek-r1-distill-llama-70bDeepSeek R1 Distill LLama 70B |
— | 8,192 | 2 | $0.88/1M | $0.88/1M | prepaid BYOK |
deepseek/deepseek-r1-turboDeepSeek R1 (Turbo) |
— | 64,000 | 2 | $0.77/1M | $2.75/1M | prepaid BYOK |
deepseek/deepseek-v3-0324DeepSeek V3 0324 |
— | 163,840 | 2 | $0.297/1M | $1.232/1M | prepaid BYOK |
deepseek/deepseek-v3-turboDeepSeek V3 (Turbo) |
— | 64,000 | 2 | $0.44/1M | $1.43/1M | prepaid BYOK |
deepseek/deepseek-v3.1DeepSeek V3.1 |
IQ 92#67 | 131,072 | 2 | $0.297/1M | $1.1/1M | prepaid BYOK |
deepseek/deepseek-v3.1-terminusDeepseek V3.1 Terminus |
— | 131,072 | 2 | $0.297/1M | $1.1/1M | prepaid BYOK |
deepseek/deepseek-v3.2DeepSeek: DeepSeek V3.2 |
IQ 101#47 | 163,840 | 2 | $0.2959/1M | $0.44/1M | prepaid BYOK |
deepseek/deepseek-v3.2-expDeepseek V3.2 Exp |
— | 163,840 | 2 | $0.297/1M | $0.451/1M | prepaid BYOK |
deepseek/deepseek-v4-flashDeepSeek: DeepSeek V4 Flash |
IQ 104#38 | 1,048,576 | 2 | $0.154/1M | $0.308/1M | prepaid BYOK |
deepseek/deepseek-v4-proDeepSeek: DeepSeek V4 Pro |
IQ 109#28 | 1,048,576 | 2 | $1.859/1M | $3.718/1M | prepaid BYOK |
elephantLing-2.6-flash |
— | 262,144 | 2 | $0.11/1M | $0.33/1M | prepaid BYOK |
google/gemma-3-12b-itGoogle: Gemma 3 12B |
— | 131,072 | 2 | $0.055/1M | $0.11/1M | prepaid BYOK |
google/gemma-3-27b-itGoogle: Gemma 3 27B |
— | 131,072 | 2 | $0.1309/1M | $0.22/1M | prepaid BYOK |
google/gemma-4-26b-a4b-itGoogle: Gemma 4 26B A4B |
IQ 94#63 | 262,144 | 2 | $0.143/1M | $0.44/1M | prepaid BYOK |
google/gemma-4-31b-itGoogle: Gemma 4 31B |
IQ 98#52 | 262,144 | 2 | $0.154/1M | $0.44/1M | prepaid BYOK |
gryphe/mythomax-l2-13bMythomax L2 13B |
— | 4,096 | 1 | $0.099/1M | $0.099/1M | BYOK |
inclusionai/ling-2.6-1tLing-2.6-1T |
— | 262,144 | 2 | $0.33/1M | $2.75/1M | prepaid BYOK |
inclusionai/ling-2.6-flashLing-2.6-flash |
— | 262,144 | 2 | $0.11/1M | $0.33/1M | prepaid BYOK |
inclusionai/ring-2.6-1tRing-2.6-1T |
IQ 103#45 | 262,144 | 2 | $0.01/1M | $0.01/1M | prepaid BYOK |
kwaipilot/kat-coder-proKat Coder Pro |
— | 256,000 | 2 | $0.33/1M | $1.32/1M | prepaid BYOK |
meta-llama/llama-3-70b-instructLlama3 70B Instruct |
— | 8,192 | 2 | $0.561/1M | $0.814/1M | prepaid BYOK |
meta-llama/llama-3-8b-instructLlama 3 8B Instruct |
— | 8,192 | 1 | $0.044/1M | $0.044/1M | BYOK |
meta-llama/llama-3.1-8b-instructMeta: Llama 3.1 8B Instruct |
— | 131,072 | 2 | $0.022/1M | $0.055/1M | prepaid BYOK |
meta-llama/llama-3.2-3b-instructLlama 3.2 3B Instruct |
— | 32,768 | 1 | $0.033/1M | $0.055/1M | BYOK |
meta-llama/llama-3.3-70b-instructMeta: Llama 3.3 70B Instruct |
— | 131,072 | 2 | $0.1485/1M | $0.44/1M | prepaid BYOK |
meta-llama/llama-4-maverick-17b-128e-instruct-fp8Llama 4 Maverick Instruct |
— | 1,048,576 | 2 | $0.297/1M | $0.935/1M | prepaid BYOK |
meta-llama/llama-4-scout-17b-16e-instructLlama 4 Scout Instruct |
— | 131,072 | 2 | $0.198/1M | $0.649/1M | prepaid BYOK |
microsoft/wizardlm-2-8x22bWizardlm 2 8x22B |
— | 65,535 | 2 | $0.682/1M | $0.682/1M | prepaid BYOK |
minimax/minimax-m2MiniMax-M2 |
— | 204,800 | 2 | $0.33/1M | $1.32/1M | prepaid BYOK |
minimax/minimax-m2.1Minimax M2.1 |
IQ 100#51 | 204,800 | 2 | $0.33/1M | $1.32/1M | prepaid BYOK |
minimax/minimax-m2.5MiniMax: MiniMax M2.5 |
IQ 103#43 | 204,800 | 2 | $0.33/1M | $1.32/1M | prepaid BYOK |
minimax/minimax-m2.5-highspeedMiniMax M2.5-highspeed |
— | 204,800 | 2 | $0.66/1M | $2.64/1M | prepaid BYOK |
minimax/minimax-m2.7MiniMax M2.7 |
IQ 105#37 | 204,800 | 2 | $0.33/1M | $1.32/1M | prepaid BYOK |
minimaxai/minimax-m1-80kMiniMax M1 |
— | 1,000,000 | 2 | $0.605/1M | $2.42/1M | prepaid BYOK |
mistralai/mistral-nemoMistral: Mistral Nemo |
— | 131,072 | 2 | $0.044/1M | $0.187/1M | prepaid BYOK |
moonshotai/kimi-k2-0905Kimi K2 0905 |
— | 262,144 | 2 | $0.66/1M | $2.75/1M | prepaid BYOK |
moonshotai/kimi-k2-instructKimi K2 Instruct |
IQ 92#70 | 131,072 | 2 | $0.627/1M | $2.53/1M | prepaid BYOK |
moonshotai/kimi-k2-thinkingKimi K2 Thinking |
IQ 92#70 | 262,144 | 2 | $0.66/1M | $2.75/1M | prepaid BYOK |
moonshotai/kimi-k2.5MoonshotAI: Kimi K2.5 |
IQ 109#29 | 262,144 | 2 | $0.66/1M | $3.3/1M | prepaid BYOK |
moonshotai/kimi-k2.6MoonshotAI: Kimi K2.6 |
IQ 117#11 | 262,144 | 2 | $0.88/1M | $3.74/1M | prepaid BYOK |
moonshotai/kimi-k2.7-codeMoonshotAI: Kimi K2.7 Code |
IQ 116#13 | 262,144 | 2 | $1.045/1M | $4.4/1M | prepaid BYOK |
nousresearch/hermes-2-pro-llama-3-8bHermes 2 Pro Llama 3 8B |
— | 8,192 | 1 | $0.154/1M | $0.154/1M | BYOK |
openai/gpt-oss-120bOpenAI: gpt-oss-120b |
IQ 95#59 | 131,072 | 2 | $0.055/1M | $0.275/1M | prepaid BYOK |
openai/gpt-oss-20bOpenAI: gpt-oss-20b |
IQ 92#69 | 131,072 | 2 | $0.044/1M | $0.165/1M | prepaid BYOK |
paddlepaddle/paddleocr-vlPaddleOCR-VL |
— | 16,384 | 1 | $0.022/1M | $0.022/1M | BYOK |
qwen/qwen-2.5-72b-instructQwen2.5 72B Instruct |
— | 131,072 | 2 | $0.418/1M | $0.44/1M | prepaid BYOK |
qwen/qwen-mt-plusQwen MT Plus |
— | 16,384 | 2 | $0.275/1M | $0.825/1M | prepaid BYOK |
qwen/qwen2.5-7b-instructQwen2.5 7B Instruct |
— | 32,000 | 1 | $0.077/1M | $0.077/1M | BYOK |
qwen/qwen2.5-vl-72b-instructQwen: Qwen2.5 VL 72B Instruct |
— | 131,072 | 1 | $0.88/1M | $0.88/1M | BYOK |
qwen/qwen3-235b-a22b-fp8Qwen3 235B A22B |
— | 40,960 | 2 | $0.22/1M | $0.88/1M | prepaid BYOK |
qwen/qwen3-235b-a22b-instruct-2507Qwen3 235B A22B Instruct 2507 |
— | 131,072 | 2 | $0.099/1M | $0.638/1M | prepaid BYOK |
qwen/qwen3-235b-a22b-thinking-2507Qwen: Qwen3 235B A22B Thinking 2507 |
— | 262,144 | 2 | $0.33/1M | $3.3/1M | prepaid BYOK |
qwen/qwen3-30b-a3b-fp8Qwen3 30B A3B |
— | 40,960 | 1 | $0.099/1M | $0.495/1M | BYOK |
qwen/qwen3-32b-fp8Qwen3 32B |
— | 40,960 | 1 | $0.11/1M | $0.495/1M | BYOK |
qwen/qwen3-4b-fp8Qwen3 4B |
— | 128,000 | 1 | $0.033/1M | $0.033/1M | BYOK |
qwen/qwen3-8b-fp8Qwen3 8B |
— | 128,000 | 1 | $0.0385/1M | $0.1518/1M | BYOK |
qwen/qwen3-coder-30b-a3b-instructQwen3 Coder 30b A3B Instruct |
— | 160,000 | 2 | $0.077/1M | $0.297/1M | prepaid BYOK |
qwen/qwen3-coder-480b-a35b-instructQwen3 Coder 480B A35B Instruct |
— | 262,144 | 2 | $0.418/1M | $1.705/1M | prepaid BYOK |
qwen/qwen3-coder-nextQwen: Qwen3 Coder Next |
— | 262,144 | 2 | $0.22/1M | $1.65/1M | prepaid BYOK |
qwen/qwen3-maxQwen3 Max |
— | 262,144 | 2 | $2.321/1M | $9.295/1M | prepaid BYOK |
qwen/qwen3-next-80b-a3b-instructQwen: Qwen3 Next 80B A3B Instruct |
— | 262,144 | 2 | $0.165/1M | $1.65/1M | prepaid BYOK |
qwen/qwen3-omni-30b-a3b-instructQwen3 Omni 30B A3B Instruct |
— | 65,536 | 2 | $0.01/1M | $0.01/1M | prepaid BYOK |
qwen/qwen3-omni-30b-a3b-thinkingQwen3 Omni 30B A3B Thinking |
— | 65,536 | 2 | $0.01/1M | $0.01/1M | prepaid BYOK |
qwen/qwen3-vl-235b-a22b-instructQwen: Qwen3 VL 235B A22B Instruct |
— | 262,144 | 2 | $0.33/1M | $1.65/1M | prepaid BYOK |
qwen/qwen3-vl-235b-a22b-thinkingQwen3 VL 235B A22B Thinking |
— | 131,072 | 2 | $1.078/1M | $4.345/1M | prepaid BYOK |
qwen/qwen3-vl-30b-a3b-instructQwen: Qwen3 VL 30B A3B Instruct |
— | 262,144 | 2 | $0.22/1M | $0.77/1M | prepaid BYOK |
qwen/qwen3.5-122b-a10bQwen3.5-122B-A10B |
— | 262,144 | 2 | $0.44/1M | $3.52/1M | prepaid BYOK |
qwen/qwen3.5-27bQwen: Qwen3.5-27B |
— | 262,144 | 2 | $0.33/1M | $2.64/1M | prepaid BYOK |
qwen/qwen3.5-35b-a3bQwen: Qwen3.5-35B-A3B |
— | 262,144 | 2 | $0.275/1M | $2.2/1M | prepaid BYOK |
qwen/qwen3.5-397b-a17bQwen: Qwen3.5 397B A17B |
— | 262,144 | 2 | $0.66/1M | $3.96/1M | prepaid BYOK |
qwen/qwen3.6-27bQwen: Qwen3.6 27B |
IQ 104#41 | 262,144 | 2 | $0.66/1M | $3.96/1M | prepaid BYOK |
qwen/qwen3.6-35b-a3bQwen: Qwen3.6 35B A3B |
IQ 96#56 | 262,144 | 2 | $0.2728/1M | $1.6335/1M | prepaid BYOK |
sao10k/l3-70b-euryale-v2.1L3 70B Euryale V2.1 |
— | 8,192 | 1 | $1.628/1M | $1.628/1M | BYOK |
sao10k/l3-8b-lunarisSao10k L3 8B Lunaris |
— | 8,192 | 2 | $0.055/1M | $0.055/1M | prepaid BYOK |
sao10k/l31-70b-euryale-v2.2L31 70B Euryale V2.2 |
— | 8,192 | 2 | $1.628/1M | $1.628/1M | prepaid BYOK |
xiaomimimo/mimo-v2-flashXiaomiMiMo/MiMo-V2-Flash |
IQ 101#48 | 262,144 | 1 | $0.11/1M | $0.33/1M | BYOK |
xiaomimimo/mimo-v2.5-proXiaomiMiMo/MiMo-V2.5-Pro |
IQ 112#22 | 1,048,576 | 2 | $2.2/1M | $6.6/1M | prepaid BYOK |
z-ai/glm-5.2GLM 5.2 |
IQ 117#10 | 1,048,576 | 2 | $1.54/1M | $4.84/1M | prepaid BYOK |
zai-org/autoglm-phone-9b-multilingualAutoGLM-Phone-9B-Multilingual |
— | 65,536 | 2 | $0.0385/1M | $0.1518/1M | prepaid BYOK |
zai-org/glm-4.5GLM-4.5 |
— | 131,072 | 2 | $0.66/1M | $2.42/1M | prepaid BYOK |
zai-org/glm-4.5-airzai-org/glm-4.5-air |
— | 131,072 | 2 | $0.143/1M | $0.935/1M | prepaid BYOK |
zai-org/glm-4.5vGLM 4.5V |
— | 65,536 | 2 | $0.66/1M | $1.98/1M | prepaid BYOK |
zai-org/glm-4.6GLM 4.6 |
— | 204,800 | 2 | $0.605/1M | $2.42/1M | prepaid BYOK |
zai-org/glm-4.6vGLM 4.6V |
— | 131,072 | 2 | $0.33/1M | $0.99/1M | prepaid BYOK |
zai-org/glm-4.7GLM-4.7 |
IQ 102#46 | 204,800 | 2 | $0.66/1M | $2.42/1M | prepaid BYOK |
zai-org/glm-4.7-flashGLM-4.7-Flash |
— | 200,000 | 2 | $0.077/1M | $0.44/1M | prepaid BYOK |
zai-org/glm-5GLM-5 |
IQ 107#34 | 202,800 | 2 | $1.1/1M | $3.52/1M | prepaid BYOK |
zai-org/glm-5.1GLM-5.1 |
IQ 113#19 | 204,800 | 2 | $1.518/1M | $4.84/1M | prepaid BYOK |