OpenAI compatible API. Attested gateway. Public status.

DeepSeek: DeepSeek V4 Pro Benchmarks

Name: DeepSeek: DeepSeek V4 Pro
Brand: DeepSeek
Price: 0.478500 USD
Availability: InStock

Benchmark and measurement links for DeepSeek: DeepSeek V4 Pro, with TrustedRouter route data first.

Verify gateway

1 URLbase_url migration

100smodels and routes

0prompt logs by default

`deepseek/deepseek-v4-pro`

open weights Benchmarks

All models

AI IQ IQ 109 #28 public AI IQ rank for deepseek-v4-pro

View AI IQ profile

Published benchmark scores

Benchmark scores for DeepSeek: DeepSeek V4 Pro — every row links to its source, and a score is only ever attached to the exact checkpoint it was measured on. Vendor model-card and open-leaderboard numbers are cited, not run by us. Rows marked TrustedRouter · replays published are our own runs of this model through the gateway, with the full per-item replay published in trustedrouter-benchmarks so anyone can re-grade them.

Benchmark	Category	Score	Source
Aider Polyglot 34 Exercism exercises (Python), pass@1, real unit tests (no judge)	Coding	20.6%	TrustedRouter Benchmarks replay 2026-06-18
SimpleQA Verified 250 closed-book questions, no tools; GPT-4.1 autorater (Google's exact prompt); 32768-token budget	Factuality	55.1%	TrustedRouter Benchmarks replay 2026-06-18
IFEval 100-prompt subset, 0-shot; Google's deterministic verifiers (no judge); score = avg of strict/loose x prompt/instruction	Instruction following	36.2%	TrustedRouter Benchmarks replay 2026-06-18
MMLU-Pro 200-question stride-sampled subset (TIGER-Lab/MMLU-Pro), 10-choice CoT, letter-match; no judge	Knowledge	83.3%	TrustedRouter Benchmarks replay 2026-06-18
GSM8K 30-problem subset, deterministic numeric match (no judge); near-saturated, kept as a sanity check	Math	100.0%	TrustedRouter Benchmarks replay 2026-06-18

TrustedRouter measurements

TrustedRouter publishes route and status measurements without storing prompt or output content. Provider latency and uptime are exposed through the model performance and uptime pages.

External benchmark references

TrustedRouter performance pageTrustedRouter measurement
TrustedRouter uptime pageTrustedRouter measurement
AI IQ profile · IQ 109Independent model IQ score
DeepSeek API docsOfficial model information
LMArena leaderboardIndependent benchmark index
LiveBenchIndependent benchmark index
Artificial Analysis modelsIndependent benchmark index
HELMIndependent benchmark index