OpenAI compatible API. Attested gateway. Public status.

Novita AI

Novita AI models on TrustedRouter with prices, routes, policy notes, and source links.

Verify gateway
1 URLbase_url migration
100smodels and routes
0prompt logs by default

novita

No provider claim

All providers

ProviderNovita AI
Models101 public models
Prepaid routes83
BYOK routes101
Zero data retentionnot claimed
Confidential computenot claimed
Provider E2EEnot claimed
Policy noteNo provider-ZDR claim is tracked here. Novita's privacy policy says personal information is not used for model training; customer-content processing is governed by customer agreements.
Policy source

Measured performance

259 samples

Continuously sampled across Novita AI's routed models — p50 TTFT, throughput, and success rate. Unsupported route and probe-configuration rows are separated from provider downtime. No prompt or output content stored.

p50 TTFT7094 ms
Throughput
Uptime88.03%
Modelp50 TTFTp50 TTFBThroughputUptimeConfig excludedSamples
mistralai/mistral-nemo 1225 ms 1225 ms 100.00% 3
qwen/qwen3.5-35b-a3b 1329 ms 1329 ms 100.00% 2
qwen/qwen3-235b-a22b-thinking-2507 1444 ms 1443 ms 100.00% 2
qwen/qwen3-coder-480b-a35b-instruct 1468 ms 1468 ms 75.00% 4
qwen/qwen-2.5-72b-instruct 1487 ms 1487 ms 100.00% 2
deepseek/deepseek-r1-turbo 1507 ms 1506 ms 100.00% 3
zai-org/glm-4.7-flash 1545 ms 1545 ms 66.67% 3
deepseek/deepseek-v3.2-exp 1616 ms 1615 ms 100.00% 3
qwen/qwen3-235b-a22b-instruct-2507 1787 ms 1786 ms 100.00% 2
inclusionai/ling-2.6-1t 1826 ms 1825 ms 100.00% 3
deepseek/deepseek-ocr 1895 ms 1894 ms 100.00% 3
qwen/qwen3-max 1909 ms 1908 ms 100.00% 2
qwen/qwen3-coder-30b-a3b-instruct 2010 ms 2009 ms 100.00% 2
deepseek/deepseek-v3.1-terminus 2013 ms 2012 ms 100.00% 2
deepseek/deepseek-v3.2 2074 ms 2073 ms 100.00% 2
meta-llama/llama-3.1-8b-instruct 2077 ms 2076 ms 100.00% 2
deepseek/deepseek-r1-distill-llama-70b 2140 ms 2139 ms 100.00% 3
deepseek/deepseek-v4-pro 2233 ms 2232 ms 100.00% 2
openai/gpt-oss-120b 2291 ms 2290 ms 100.00% 4
kwaipilot/kat-coder-pro 2391 ms 2390 ms 100.00% 3
minimax/minimax-m2.5-highspeed 2865 ms 2864 ms 100.00% 2
qwen/qwen3-vl-30b-a3b-instruct 2936 ms 2936 ms 100.00% 4
minimax/minimax-m2.7 2970 ms 2969 ms 100.00% 3
zai-org/glm-4.6 3318 ms 3318 ms 100.00% 2
sao10k/l3-8b-lunaris 3563 ms 3563 ms 100.00% 2
qwen/qwen3-next-80b-a3b-instruct 3590 ms 3589 ms 100.00% 4
moonshotai/kimi-k2-0905 3695 ms 3695 ms 100.00% 3
meta-llama/llama-4-maverick-17b-128e-instruct-fp8 3908 ms 3908 ms 100.00% 4
zai-org/glm-5 3954 ms 3953 ms 100.00% 4
microsoft/wizardlm-2-8x22b 3991 ms 3991 ms 100.00% 2
zai-org/glm-4.7 4630 ms 4629 ms 100.00% 6
moonshotai/kimi-k2.6 5094 ms 5093 ms 100.00% 4
qwen/qwen3.5-397b-a17b 5208 ms 5208 ms 100.00% 6
sao10k/l31-70b-euryale-v2.2 5219 ms 5219 ms 100.00% 2
qwen/qwen3-coder-next 5285 ms 5285 ms 100.00% 1
deepseek/deepseek-v3-0324 5544 ms 5544 ms 100.00% 3
moonshotai/kimi-k2-instruct 5592 ms 5591 ms 100.00% 3
qwen/qwen3-omni-30b-a3b-instruct 5758 ms 5757 ms 100.00% 3
inclusionai/ring-2.6-1t 5943 ms 5943 ms 100.00% 3
qwen/qwen3-235b-a22b-fp8 7094 ms 7094 ms 87.50% 8
zai-org/glm-4.5-air 7286 ms 7285 ms 100.00% 4
minimax/minimax-m2.1 7310 ms 7310 ms 100.00% 4
zai-org/glm-4.5v 7364 ms 7364 ms 100.00% 3
xiaomimimo/mimo-v2.5-pro 7456 ms 7455 ms 100.00% 3
qwen/qwen3-vl-235b-a22b-thinking 7518 ms 7518 ms 100.00% 4
google/gemma-3-27b-it 7540 ms 7540 ms 100.00% 3
minimax/minimax-m2.5 7556 ms 7555 ms 100.00% 3
qwen/qwen3-vl-235b-a22b-instruct 7793 ms 7792 ms 100.00% 5
qwen/qwen3.6-35b-a3b 7842 ms 7841 ms 100.00% 3
inclusionai/ling-2.6-flash 7944 ms 7944 ms 75.00% 4
minimax/minimax-m2 8099 ms 8098 ms 100.00% 4
qwen/qwen3.6-27b 8290 ms 8290 ms 100.00% 2
minimaxai/minimax-m1-80k 8423 ms 8422 ms 100.00% 3
deepseek/deepseek-v3-turbo 8522 ms 8522 ms 100.00% 6
deepseek/deepseek-v4-flash 8623 ms 8623 ms 100.00% 7
zai-org/glm-5.1 8654 ms 8653 ms 100.00% 2
qwen/qwen-mt-plus 8719 ms 8718 ms 100.00% 6
meta-llama/llama-4-scout-17b-16e-instruct 9171 ms 9170 ms 100.00% 5
zai-org/autoglm-phone-9b-multilingual 9359 ms 9358 ms 100.00% 2
baidu/ernie-4.5-vl-424b-a47b 9394 ms 9393 ms 100.00% 6
google/gemma-4-26b-a4b-it 10034 ms 10033 ms 100.00% 3
zai-org/glm-4.6v 10058 ms 10058 ms 100.00% 6
qwen/qwen3.5-122b-a10b 10070 ms 10070 ms 100.00% 6
moonshotai/kimi-k2.5 10409 ms 10408 ms 100.00% 1
deepseek/deepseek-ocr-2 10569 ms 10568 ms 100.00% 3
deepseek/deepseek-r1-0528 11805 ms 11804 ms 100.00% 3
openai/gpt-oss-20b 14010 ms 14009 ms 100.00% 1
qwen/qwen3-omni-30b-a3b-thinking 14980 ms 14979 ms 100.00% 1
deepseek/deepseek-v3.1 14982 ms 14982 ms 100.00% 2
moonshotai/kimi-k2.7-code 15450 ms 15449 ms 100.00% 1
Sao10K/L3-8B-Stheno-v3.2 16492 ms 16491 ms 100.00% 5
zai-org/glm-4.5 0.00% 2
baidu/ernie-4.5-21B-a3b 0.00% 4
google/gemma-3-12b-it 0.00% 4
meta-llama/llama-3-70b-instruct 0.00% 2
moonshotai/kimi-k2-thinking 0.00% 4
meta-llama/llama-3.3-70b-instruct 0.00% 3
elephant 0.00% 3
baichuan/baichuan-m2-32b 0.00% 2
baidu/ernie-4.5-vl-28b-a3b 0.00% 3

Full provider & model leaderboard.

Provider models

Models served by Novita AI.

Each row links to pricing, provider, benchmark, and API pages for the model.

Model AI IQ Context Endpoints Prompt Completion Routes
Sao10K/L3-8B-Stheno-v3.2
L3 8B Stheno V3.2
8,192 2 $0.055/1M $0.055/1M prepaid BYOK
baichuan/baichuan-m2-32b
BaiChuan M2 32B
131,072 2 $0.077/1M $0.077/1M prepaid BYOK
baidu/ernie-4.5-21B-a3b
ERNIE 4.5 21B A3B
120,000 2 $0.077/1M $0.308/1M prepaid BYOK
baidu/ernie-4.5-21B-a3b-thinking
ERNIE-4.5-21B-A3B-Thinking
131,072 1 $0.077/1M $0.308/1M BYOK
baidu/ernie-4.5-300b-a47b-paddle
ERNIE 4.5 300B A47B
123,000 1 $0.308/1M $1.21/1M BYOK
baidu/ernie-4.5-vl-28b-a3b
ERNIE 4.5 VL 28B A3B
30,000 2 $0.154/1M $0.616/1M prepaid BYOK
baidu/ernie-4.5-vl-28b-a3b-thinking
ERNIE-4.5-VL-28B-A3B-Thinking
131,072 1 $0.429/1M $0.429/1M BYOK
baidu/ernie-4.5-vl-424b-a47b
ERNIE 4.5 VL 424B A47B
123,000 2 $0.462/1M $1.375/1M prepaid BYOK
deepseek/deepseek-ocr
DeepSeek-OCR
8,192 2 $0.033/1M $0.033/1M prepaid BYOK
deepseek/deepseek-ocr-2
DeepSeek-OCR 2
8,192 2 $0.033/1M $0.033/1M prepaid BYOK
deepseek/deepseek-prover-v2-671b
Deepseek Prover V2 671B
160,000 1 $0.77/1M $2.75/1M BYOK
deepseek/deepseek-r1-0528
DeepSeek R1 0528
163,840 2 $0.77/1M $2.75/1M prepaid BYOK
deepseek/deepseek-r1-0528-qwen3-8b
DeepSeek R1 0528 Qwen3 8B
128,000 1 $0.066/1M $0.099/1M BYOK
deepseek/deepseek-r1-distill-llama-70b
DeepSeek R1 Distill LLama 70B
8,192 2 $0.88/1M $0.88/1M prepaid BYOK
deepseek/deepseek-r1-turbo
DeepSeek R1 (Turbo)
64,000 2 $0.77/1M $2.75/1M prepaid BYOK
deepseek/deepseek-v3-0324
DeepSeek V3 0324
163,840 2 $0.297/1M $1.232/1M prepaid BYOK
deepseek/deepseek-v3-turbo
DeepSeek V3 (Turbo)
64,000 2 $0.44/1M $1.43/1M prepaid BYOK
deepseek/deepseek-v3.1
DeepSeek V3.1
IQ 92#67 131,072 2 $0.297/1M $1.1/1M prepaid BYOK
deepseek/deepseek-v3.1-terminus
Deepseek V3.1 Terminus
131,072 2 $0.297/1M $1.1/1M prepaid BYOK
deepseek/deepseek-v3.2
DeepSeek: DeepSeek V3.2
IQ 101#47 163,840 2 $0.2959/1M $0.44/1M prepaid BYOK
deepseek/deepseek-v3.2-exp
Deepseek V3.2 Exp
163,840 2 $0.297/1M $0.451/1M prepaid BYOK
deepseek/deepseek-v4-flash
DeepSeek: DeepSeek V4 Flash
IQ 104#38 1,048,576 2 $0.154/1M $0.308/1M prepaid BYOK
deepseek/deepseek-v4-pro
DeepSeek: DeepSeek V4 Pro
IQ 109#28 1,048,576 2 $1.859/1M $3.718/1M prepaid BYOK
elephant
Ling-2.6-flash
262,144 2 $0.11/1M $0.33/1M prepaid BYOK
google/gemma-3-12b-it
Google: Gemma 3 12B
131,072 2 $0.055/1M $0.11/1M prepaid BYOK
google/gemma-3-27b-it
Google: Gemma 3 27B
131,072 2 $0.1309/1M $0.22/1M prepaid BYOK
google/gemma-4-26b-a4b-it
Google: Gemma 4 26B A4B
IQ 94#63 262,144 2 $0.143/1M $0.44/1M prepaid BYOK
google/gemma-4-31b-it
Google: Gemma 4 31B
IQ 98#52 262,144 2 $0.154/1M $0.44/1M prepaid BYOK
gryphe/mythomax-l2-13b
Mythomax L2 13B
4,096 1 $0.099/1M $0.099/1M BYOK
inclusionai/ling-2.6-1t
Ling-2.6-1T
262,144 2 $0.33/1M $2.75/1M prepaid BYOK
inclusionai/ling-2.6-flash
Ling-2.6-flash
262,144 2 $0.11/1M $0.33/1M prepaid BYOK
inclusionai/ring-2.6-1t
Ring-2.6-1T
IQ 103#45 262,144 2 $0.01/1M $0.01/1M prepaid BYOK
kwaipilot/kat-coder-pro
Kat Coder Pro
256,000 2 $0.33/1M $1.32/1M prepaid BYOK
meta-llama/llama-3-70b-instruct
Llama3 70B Instruct
8,192 2 $0.561/1M $0.814/1M prepaid BYOK
meta-llama/llama-3-8b-instruct
Llama 3 8B Instruct
8,192 1 $0.044/1M $0.044/1M BYOK
meta-llama/llama-3.1-8b-instruct
Meta: Llama 3.1 8B Instruct
131,072 2 $0.022/1M $0.055/1M prepaid BYOK
meta-llama/llama-3.2-3b-instruct
Llama 3.2 3B Instruct
32,768 1 $0.033/1M $0.055/1M BYOK
meta-llama/llama-3.3-70b-instruct
Meta: Llama 3.3 70B Instruct
131,072 2 $0.1485/1M $0.44/1M prepaid BYOK
meta-llama/llama-4-maverick-17b-128e-instruct-fp8
Llama 4 Maverick Instruct
1,048,576 2 $0.297/1M $0.935/1M prepaid BYOK
meta-llama/llama-4-scout-17b-16e-instruct
Llama 4 Scout Instruct
131,072 2 $0.198/1M $0.649/1M prepaid BYOK
microsoft/wizardlm-2-8x22b
Wizardlm 2 8x22B
65,535 2 $0.682/1M $0.682/1M prepaid BYOK
minimax/minimax-m2
MiniMax-M2
204,800 2 $0.33/1M $1.32/1M prepaid BYOK
minimax/minimax-m2.1
Minimax M2.1
IQ 100#51 204,800 2 $0.33/1M $1.32/1M prepaid BYOK
minimax/minimax-m2.5
MiniMax: MiniMax M2.5
IQ 103#43 204,800 2 $0.33/1M $1.32/1M prepaid BYOK
minimax/minimax-m2.5-highspeed
MiniMax M2.5-highspeed
204,800 2 $0.66/1M $2.64/1M prepaid BYOK
minimax/minimax-m2.7
MiniMax M2.7
IQ 105#37 204,800 2 $0.33/1M $1.32/1M prepaid BYOK
minimaxai/minimax-m1-80k
MiniMax M1
1,000,000 2 $0.605/1M $2.42/1M prepaid BYOK
mistralai/mistral-nemo
Mistral: Mistral Nemo
131,072 2 $0.044/1M $0.187/1M prepaid BYOK
moonshotai/kimi-k2-0905
Kimi K2 0905
262,144 2 $0.66/1M $2.75/1M prepaid BYOK
moonshotai/kimi-k2-instruct
Kimi K2 Instruct
IQ 92#70 131,072 2 $0.627/1M $2.53/1M prepaid BYOK
moonshotai/kimi-k2-thinking
Kimi K2 Thinking
IQ 92#70 262,144 2 $0.66/1M $2.75/1M prepaid BYOK
moonshotai/kimi-k2.5
MoonshotAI: Kimi K2.5
IQ 109#29 262,144 2 $0.66/1M $3.3/1M prepaid BYOK
moonshotai/kimi-k2.6
MoonshotAI: Kimi K2.6
IQ 117#11 262,144 2 $0.88/1M $3.74/1M prepaid BYOK
moonshotai/kimi-k2.7-code
MoonshotAI: Kimi K2.7 Code
IQ 116#13 262,144 2 $1.045/1M $4.4/1M prepaid BYOK
nousresearch/hermes-2-pro-llama-3-8b
Hermes 2 Pro Llama 3 8B
8,192 1 $0.154/1M $0.154/1M BYOK
openai/gpt-oss-120b
OpenAI: gpt-oss-120b
IQ 95#59 131,072 2 $0.055/1M $0.275/1M prepaid BYOK
openai/gpt-oss-20b
OpenAI: gpt-oss-20b
IQ 92#69 131,072 2 $0.044/1M $0.165/1M prepaid BYOK
paddlepaddle/paddleocr-vl
PaddleOCR-VL
16,384 1 $0.022/1M $0.022/1M BYOK
qwen/qwen-2.5-72b-instruct
Qwen2.5 72B Instruct
131,072 2 $0.418/1M $0.44/1M prepaid BYOK
qwen/qwen-mt-plus
Qwen MT Plus
16,384 2 $0.275/1M $0.825/1M prepaid BYOK
qwen/qwen2.5-7b-instruct
Qwen2.5 7B Instruct
32,000 1 $0.077/1M $0.077/1M BYOK
qwen/qwen2.5-vl-72b-instruct
Qwen: Qwen2.5 VL 72B Instruct
131,072 1 $0.88/1M $0.88/1M BYOK
qwen/qwen3-235b-a22b-fp8
Qwen3 235B A22B
40,960 2 $0.22/1M $0.88/1M prepaid BYOK
qwen/qwen3-235b-a22b-instruct-2507
Qwen3 235B A22B Instruct 2507
131,072 2 $0.099/1M $0.638/1M prepaid BYOK
qwen/qwen3-235b-a22b-thinking-2507
Qwen: Qwen3 235B A22B Thinking 2507
262,144 2 $0.33/1M $3.3/1M prepaid BYOK
qwen/qwen3-30b-a3b-fp8
Qwen3 30B A3B
40,960 1 $0.099/1M $0.495/1M BYOK
qwen/qwen3-32b-fp8
Qwen3 32B
40,960 1 $0.11/1M $0.495/1M BYOK
qwen/qwen3-4b-fp8
Qwen3 4B
128,000 1 $0.033/1M $0.033/1M BYOK
qwen/qwen3-8b-fp8
Qwen3 8B
128,000 1 $0.0385/1M $0.1518/1M BYOK
qwen/qwen3-coder-30b-a3b-instruct
Qwen3 Coder 30b A3B Instruct
160,000 2 $0.077/1M $0.297/1M prepaid BYOK
qwen/qwen3-coder-480b-a35b-instruct
Qwen3 Coder 480B A35B Instruct
262,144 2 $0.418/1M $1.705/1M prepaid BYOK
qwen/qwen3-coder-next
Qwen: Qwen3 Coder Next
262,144 2 $0.22/1M $1.65/1M prepaid BYOK
qwen/qwen3-max
Qwen3 Max
262,144 2 $2.321/1M $9.295/1M prepaid BYOK
qwen/qwen3-next-80b-a3b-instruct
Qwen: Qwen3 Next 80B A3B Instruct
262,144 2 $0.165/1M $1.65/1M prepaid BYOK
qwen/qwen3-omni-30b-a3b-instruct
Qwen3 Omni 30B A3B Instruct
65,536 2 $0.01/1M $0.01/1M prepaid BYOK
qwen/qwen3-omni-30b-a3b-thinking
Qwen3 Omni 30B A3B Thinking
65,536 2 $0.01/1M $0.01/1M prepaid BYOK
qwen/qwen3-vl-235b-a22b-instruct
Qwen: Qwen3 VL 235B A22B Instruct
262,144 2 $0.33/1M $1.65/1M prepaid BYOK
qwen/qwen3-vl-235b-a22b-thinking
Qwen3 VL 235B A22B Thinking
131,072 2 $1.078/1M $4.345/1M prepaid BYOK
qwen/qwen3-vl-30b-a3b-instruct
Qwen: Qwen3 VL 30B A3B Instruct
262,144 2 $0.22/1M $0.77/1M prepaid BYOK
qwen/qwen3.5-122b-a10b
Qwen3.5-122B-A10B
262,144 2 $0.44/1M $3.52/1M prepaid BYOK
qwen/qwen3.5-27b
Qwen: Qwen3.5-27B
262,144 2 $0.33/1M $2.64/1M prepaid BYOK
qwen/qwen3.5-35b-a3b
Qwen: Qwen3.5-35B-A3B
262,144 2 $0.275/1M $2.2/1M prepaid BYOK
qwen/qwen3.5-397b-a17b
Qwen: Qwen3.5 397B A17B
262,144 2 $0.66/1M $3.96/1M prepaid BYOK
qwen/qwen3.6-27b
Qwen: Qwen3.6 27B
IQ 104#41 262,144 2 $0.66/1M $3.96/1M prepaid BYOK
qwen/qwen3.6-35b-a3b
Qwen: Qwen3.6 35B A3B
IQ 96#56 262,144 2 $0.2728/1M $1.6335/1M prepaid BYOK
sao10k/l3-70b-euryale-v2.1
L3 70B Euryale V2.1
8,192 1 $1.628/1M $1.628/1M BYOK
sao10k/l3-8b-lunaris
Sao10k L3 8B Lunaris
8,192 2 $0.055/1M $0.055/1M prepaid BYOK
sao10k/l31-70b-euryale-v2.2
L31 70B Euryale V2.2
8,192 2 $1.628/1M $1.628/1M prepaid BYOK
xiaomimimo/mimo-v2-flash
XiaomiMiMo/MiMo-V2-Flash
IQ 101#48 262,144 1 $0.11/1M $0.33/1M BYOK
xiaomimimo/mimo-v2.5-pro
XiaomiMiMo/MiMo-V2.5-Pro
IQ 112#22 1,048,576 2 $2.2/1M $6.6/1M prepaid BYOK
z-ai/glm-5.2
GLM 5.2
IQ 117#10 1,048,576 2 $1.54/1M $4.84/1M prepaid BYOK
zai-org/autoglm-phone-9b-multilingual
AutoGLM-Phone-9B-Multilingual
65,536 2 $0.0385/1M $0.1518/1M prepaid BYOK
zai-org/glm-4.5
GLM-4.5
131,072 2 $0.66/1M $2.42/1M prepaid BYOK
zai-org/glm-4.5-air
zai-org/glm-4.5-air
131,072 2 $0.143/1M $0.935/1M prepaid BYOK
zai-org/glm-4.5v
GLM 4.5V
65,536 2 $0.66/1M $1.98/1M prepaid BYOK
zai-org/glm-4.6
GLM 4.6
204,800 2 $0.605/1M $2.42/1M prepaid BYOK
zai-org/glm-4.6v
GLM 4.6V
131,072 2 $0.33/1M $0.99/1M prepaid BYOK
zai-org/glm-4.7
GLM-4.7
IQ 102#46 204,800 2 $0.66/1M $2.42/1M prepaid BYOK
zai-org/glm-4.7-flash
GLM-4.7-Flash
200,000 2 $0.077/1M $0.44/1M prepaid BYOK
zai-org/glm-5
GLM-5
IQ 107#34 202,800 2 $1.1/1M $3.52/1M prepaid BYOK
zai-org/glm-5.1
GLM-5.1
IQ 113#19 204,800 2 $1.518/1M $4.84/1M prepaid BYOK

Sign in

Choose a sign in method.