AI Model Leaderboard

Top 565 AI models ranked by score. Compare providers, pricing, speed and capabilities.

Updated May 31, 2026

This independent leaderboard tracks 565 large language models and ranks them by capability, price and speed. Design for Online aggregates third-party benchmarks with our own editorial testing, refreshing the data daily so the ranking reflects the models you can actually use today, not last year's headlines.

# Model Score AI Index Context Input / 1M Output / 1M Caps
1Anthropic: Claude Opus 4.8anthropic New Top Pick In-House Pick93.461.41M$5.00$25.00
2Google: Gemini 3.1 Pro Previewgoogle Best for Agents92.157.21M$2.00$12.00
3OpenAI: GPT-5.5openai Top Pick9060.21.1M$5.00$30.00
4Anthropic: Claude Opus 4.7anthropic88.857.31M$5.00$25.00
5Qwen: Qwen3.7 Maxqwen New83.956.61M$1.25$3.75
6OpenAI: GPT-5.3-Codexopenai83.553.6400K$1.75$14.00
7Anthropic: Claude Opus 4.6anthropic82.652.91M$5.00$25.00
8Anthropic: Claude Opus 4.5anthropic82.549.7200K$5.00$25.00
9Google: Gemini 3.5 Flashgoogle New82.254.81M$1.50$9.00
10Google: Gemini 3 Flash Previewgoogle82351M$0.5000$3.00
11Google: Gemini 3 Pro Previewgoogle8241.31M$2.00$12.00
12Z.ai: GLM 4.6z-ai8230.2203K$0.4300$1.74
13DeepSeek: DeepSeek V3.1deepseek8227.7164K$0.2100$0.7900
14MoonshotAI: Kimi K2 0711moonshotai8226.3131K$0.5700$2.30
15OpenAI: GPT-4o-mini Search Previewopenai8212.6128K$0.1500$0.6000
16Microsoft: Phi 4microsoft8210.416K$0.0650$0.1400
17Mistral Largemistralai829.9128K$2.00$6.00
18OpenAI: GPT-5.4 Nanoopenai8244400K$0.2000$1.25
19inclusionAI: Ling-2.6-1Tinclusionai8233.6262K$0.0750$0.6250
20GPT-5.5 (high)OpenAI Top Pick8258.9$5.00$30.00
21Step3 VL 10BStepFun8215.5FreeFree
22Gemini 1.5 Flash (Sep ’24)Google8213.8FreeFree
23OLMo 2 32BAllen Institute for AI8210.6FreeFree
24Z.ai: GLM 4.6Vz-ai8217.1131K$0.3000$0.9000
25OpenAI: GPT-5 Codexopenai8244.6400K$1.25$10.00
26OpenAI: gpt-oss-120b (free)openai8233.3131KFreeFree
27Baidu: ERNIE 4.5 300B A47Bbaidu8215131K$0.2800$1.10
28Qwen: Qwen2.5 Coder 7B Instructqwen821033K$0.0300$0.0900
29Anthropic: Claude 3.7 Sonnet (thinking)anthropic8234.7200K$3.00$15.00
30NVIDIA: Nemotron 3 Nano Omni (free)nvidia8221.4256KFreeFree
31Ministral 3 14BMistral8216$0.2000$0.2000
32ERNIE 5.0 Thinking PreviewBaidu8229.1FreeFree
33Claude 2.0Anthropic829.1FreeFree
34Qwen3 4B (Reasoning)Alibaba8214.2$0.1100$1.26
35Qwen: Qwen3 Coder Nextqwen8228.3262K$0.1100$0.8000
36Qwen: Qwen3 VL 8B Instructqwen8214.3256K$0.0800$0.5000
37Z.ai: GLM 4.5 Airz-ai8223.2131K$0.1250$0.8500
38Anthropic: Claude Opus 4anthropic8239200K$15.00$75.00
39DeepSeek: DeepSeek V3 0324deepseek8222.3164K$0.2000$0.7700
40NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia8213.4131K$1.20$1.20
41Google: Gemma 4 26B A4Bgoogle8227.1262K$0.0600$0.3300
42IBM: Granite 4.1 8Bibm-granite8212.4131K$0.0500$0.1000
43Inception: Mercury 2inception8232.8128K$0.2500$0.7500
44Nova 2.0 Lite (low)Amazon8224.6$0.3000$2.50
45Qwen3.5 0.8B (Reasoning)Alibaba8210.5$0.0100$0.0500
46DeepSeek-V2.5DeepSeek8212.3FreeFree
47Mistral: Mistral Small Creativemistralai8210.233K$0.1000$0.3000
48OpenAI: GPT-4o Audioopenai8212.8128K$2.50$10.00
49Mistral: Devstral Mediummistralai8218.7131K$0.4000$2.00
50Qwen: Qwen3 4B (free)qwen8212.541KFreeFree
#1NewTop PickIn-House Pick93.4
Anthropic: Claude Opus 4.8anthropic
AI 61.41M ctx$5.00/M in
#2Best for Agents92.1
Google: Gemini 3.1 Pro Previewgoogle
AI 57.21M ctx$2.00/M in
#3Top Pick90
OpenAI: GPT-5.5openai
AI 60.21.1M ctx$5.00/M in
#488.8
Anthropic: Claude Opus 4.7anthropic
AI 57.31M ctx$5.00/M in
#5New83.9
Qwen: Qwen3.7 Maxqwen
AI 56.61M ctx$1.25/M in
#683.5
OpenAI: GPT-5.3-Codexopenai
AI 53.6400K ctx$1.75/M in
#782.6
Anthropic: Claude Opus 4.6anthropic
AI 52.91M ctx$5.00/M in
#882.5
Anthropic: Claude Opus 4.5anthropic
AI 49.7200K ctx$5.00/M in
#9New82.2
Google: Gemini 3.5 Flashgoogle
AI 54.81M ctx$1.50/M in
#1082
Google: Gemini 3 Flash Previewgoogle
AI 351M ctx$0.5000/M in
#1182
Google: Gemini 3 Pro Previewgoogle
AI 41.31M ctx$2.00/M in
#1282
Z.ai: GLM 4.6z-ai
AI 30.2203K ctx$0.4300/M in
#1382
DeepSeek: DeepSeek V3.1deepseek
AI 27.7164K ctx$0.2100/M in
#1482
MoonshotAI: Kimi K2 0711moonshotai
AI 26.3131K ctx$0.5700/M in
#1582
OpenAI: GPT-4o-mini Search Previewopenai
AI 12.6128K ctx$0.1500/M in
#1682
Microsoft: Phi 4microsoft
AI 10.416K ctx$0.0650/M in
#1782
Mistral Largemistralai
AI 9.9128K ctx$2.00/M in
#1882
OpenAI: GPT-5.4 Nanoopenai
AI 44400K ctx$0.2000/M in
#1982
inclusionAI: Ling-2.6-1Tinclusionai
AI 33.6262K ctx$0.0750/M in
#20Top Pick82
GPT-5.5 (high)OpenAI
AI 58.9$5.00/M in
#2182
Step3 VL 10BStepFun
AI 15.5Free/M in
#2282
Gemini 1.5 Flash (Sep ’24)Google
AI 13.8Free/M in
#2382
OLMo 2 32BAllen Institute for AI
AI 10.6Free/M in
#2482
Z.ai: GLM 4.6Vz-ai
AI 17.1131K ctx$0.3000/M in
#2582
OpenAI: GPT-5 Codexopenai
AI 44.6400K ctx$1.25/M in
#2682
OpenAI: gpt-oss-120b (free)openai
AI 33.3131K ctxFree/M in
#2782
Baidu: ERNIE 4.5 300B A47Bbaidu
AI 15131K ctx$0.2800/M in
#2882
Qwen: Qwen2.5 Coder 7B Instructqwen
AI 1033K ctx$0.0300/M in
#2982
Anthropic: Claude 3.7 Sonnet (thinking)anthropic
AI 34.7200K ctx$3.00/M in
#3082
NVIDIA: Nemotron 3 Nano Omni (free)nvidia
AI 21.4256K ctxFree/M in
#3182
Ministral 3 14BMistral
AI 16$0.2000/M in
#3282
ERNIE 5.0 Thinking PreviewBaidu
AI 29.1Free/M in
#3382
Claude 2.0Anthropic
AI 9.1Free/M in
#3482
Qwen3 4B (Reasoning)Alibaba
AI 14.2$0.1100/M in
#3582
Qwen: Qwen3 Coder Nextqwen
AI 28.3262K ctx$0.1100/M in
#3682
Qwen: Qwen3 VL 8B Instructqwen
AI 14.3256K ctx$0.0800/M in
#3782
Z.ai: GLM 4.5 Airz-ai
AI 23.2131K ctx$0.1250/M in
#3882
Anthropic: Claude Opus 4anthropic
AI 39200K ctx$15.00/M in
#3982
DeepSeek: DeepSeek V3 0324deepseek
AI 22.3164K ctx$0.2000/M in
#4082
NVIDIA: Llama 3.1 Nemotron 70B Instructnvidia
AI 13.4131K ctx$1.20/M in
#4182
Google: Gemma 4 26B A4Bgoogle
AI 27.1262K ctx$0.0600/M in
#4282
IBM: Granite 4.1 8Bibm-granite
AI 12.4131K ctx$0.0500/M in
#4382
Inception: Mercury 2inception
AI 32.8128K ctx$0.2500/M in
#4482
Nova 2.0 Lite (low)Amazon
AI 24.6$0.3000/M in
#4582
Qwen3.5 0.8B (Reasoning)Alibaba
AI 10.5$0.0100/M in
#4682
DeepSeek-V2.5DeepSeek
AI 12.3Free/M in
#4782
Mistral: Mistral Small Creativemistralai
AI 10.233K ctx$0.1000/M in
#4882
OpenAI: GPT-4o Audioopenai
AI 12.8128K ctx$2.50/M in
#4982
Mistral: Devstral Mediummistralai
AI 18.7131K ctx$0.4000/M in
#5082
Qwen: Qwen3 4B (free)qwen
AI 12.541K ctxFree/M in

How we rank AI models

The Design for Online AI Model Leaderboard scores 565 models on a single 0–100 scale built from four weighted dimensions: intelligence (reasoning and knowledge benchmarks), technical capability (coding and tool use), content quality (writing and instruction-following) and value (capability per dollar).

Underlying data is aggregated from the OpenRouter API for pricing and availability, Artificial Analysis for intelligence, coding and agentic indices, and the Hugging Face Open LLM Leaderboard for open-model benchmarks. We refresh these sources daily and layer our own editorial review on top, so a model that benchmarks well but is impractical to deploy will not automatically top the table.

Models are grouped into tiers (Frontier, Professional, Specialist, Efficient, Emerging and Legacy) to make like-for-like comparison easier, and newly released models are flagged so you can see what has just landed.

Leaderboard FAQ

How often is the leaderboard updated?

Pricing, availability and benchmark data are synced daily from our sources, and editorial scores are reviewed whenever a significant new model is released.

How is the overall score calculated?

Each model is graded 0–10 on intelligence, technical capability, content quality and value; those dimensions are weighted and combined into the 0–100 overall score used to rank the table.

Where does the data come from?

From the OpenRouter API, Artificial Analysis and the Hugging Face Open LLM Leaderboard, combined with hands-on editorial testing by the Design for Online team.