35 labs 105 models ranked

Anthropic

9 models

Top Claude Opus 4.8 at 9.7/10 · lineup avg 9.06/10

Claude Opus 4.8 9.7 Claude Opus 4.7 9.6 Claude Fable 5 9.6 +6 more

OpenAI

11 models

Top GPT-5.5 Pro at 9.5/10 · lineup avg 8.59/10

GPT-5.5 Pro 9.5 GPT-5.5 9.4 GPT-5 9.3 +8 more

Z Ai

4 models

Top Z.ai: GLM 5.1 at 9.1/10 · lineup avg 8.82/10

Z.ai: GLM 5.1 9.1 Z.ai: GLM 5 8.9 Z.ai: GLM 5V Turbo 8.7 +1 more

Alibaba Cloud

6 models

Top Qwen3.7 Max at 9.0/10 · lineup avg 8.08/10

Qwen3.7 Max 9.0 Qwen3.6 Max Preview 8.9 Qwen3.6 35B A3B 8.5 +3 more

DeepSeek

4 models

Top DeepSeek V4 Pro at 9.0/10 · lineup avg 8.35/10

DeepSeek V4 Pro 9.0 DeepSeek R1 8.5 DeepSeek V4 Flash 8.4 +1 more

Google

4 models

Top Gemini 3.1 Pro Preview at 8.8/10 · lineup avg 7.25/10

Gemini 3.1 Pro Preview 8.8 Google: Gemma 4 31B 7.8 Gemini 3.1 Flash Lite Preview 6.4 +1 more

Qwen

10 models

Top Qwen: Qwen3 235B A22B at 8.8/10 · lineup avg 7.36/10

Qwen: Qwen3 235B A22B 8.8 Qwen: Qwen3.7 Plus 8.5 Qwen3.6 Plus 8.2 +7 more

xAI

6 models

Top Grok 4.20 at 8.8/10 · lineup avg 8.18/10

Grok 4.20 8.8 xAI: Grok 4.20 Multi-Agent 8.6 Grok 4.3 8.5 +3 more

Deepcogito

1 model

Top Deep Cogito: Cogito v2.1 671B at 8.7/10 · lineup avg 8.7/10

Deep Cogito: Cogito v2.1 671B 8.7

Meta

3 models

Top Llama 4 405B at 8.7/10 · lineup avg 7.43/10

Llama 4 405B 8.7 Llama 4 Scout 7.8 Llama 3.3 70B 5.8

Tencent

2 models

Top Tencent: Hy3 preview at 8.6/10 · lineup avg 8.4/10

Tencent: Hy3 preview 8.6 Tencent: Hunyuan A13B Instruct 8.2

Google DeepMind

3 models

Top Gemini Pro 2 at 8.5/10 · lineup avg 8.03/10

Gemini Pro 2 8.5 Gemini 3.5 Flash 8.4 Gemini Flash 2 7.2

MiniMax

3 models

Top MiniMax M3 at 8.5/10 · lineup avg 7.9/10

MiniMax M3 8.5 MiniMax M2.7 8.0 MiniMax M2.5 7.2

Mistral AI

8 models

Top Mistral: Mistral Large 3 2512 at 8.5/10 · lineup avg 7.97/10

Mistral: Mistral Large 3 2512 8.5 Mistral: Codestral 2508 8.4 Mistral Large 3 8.3 +5 more

Moonshotai

2 models

Top Kimi K2.7 Code at 8.5/10 · lineup avg 8.45/10

Kimi K2.7 Code 8.5 MoonshotAI: Kimi K2 0905 8.4

Nvidia

4 models

Top NVIDIA: Nemotron 3 Ultra at 8.5/10 · lineup avg 8.08/10

NVIDIA: Nemotron 3 Ultra 8.5 NVIDIA: Nemotron 3 Super 8.1 NVIDIA: Llama 3.3 Nemotron Super 49B V1.5 8.1 +1 more

Xiaomi

2 models

Top Xiaomi: MiMo-V2.5-Pro at 8.5/10 · lineup avg 8.35/10

Xiaomi: MiMo-V2.5-Pro 8.5 Xiaomi: MiMo-V2.5 8.2

Amazon

2 models

Top Amazon: Nova Premier 1.0 at 8.4/10 · lineup avg 8.3/10

Amazon: Nova Premier 1.0 8.4 Amazon: Nova 2 Lite 8.2

Bytedance Seed

2 models

Top ByteDance Seed: Seed-2.0-Lite at 8.4/10 · lineup avg 8.15/10

ByteDance Seed: Seed-2.0-Lite 8.4 ByteDance Seed: Seed-2.0-Mini 7.9

Inclusionai

3 models

Top inclusionAI: Ring-2.6-1T at 8.4/10 · lineup avg 8.13/10

inclusionAI: Ring-2.6-1T 8.4 inclusionAI: Ling-2.6-1T 8.2 inclusionAI: Ling-2.6-flash 7.8

Moonshot

1 model

Top Kimi K2.6 at 8.4/10 · lineup avg 8.4/10

Kimi K2.6 8.4

Nousresearch

1 model

Top Nous: Hermes 4 405B at 8.4/10 · lineup avg 8.4/10

Nous: Hermes 4 405B 8.4

Perplexity

1 model

Top Perplexity: Sonar Pro Search at 8.4/10 · lineup avg 8.4/10

Perplexity: Sonar Pro Search 8.4

Prime Intellect

1 model

Top Prime Intellect: INTELLECT-3 at 8.3/10 · lineup avg 8.3/10

Prime Intellect: INTELLECT-3 8.3

Writer

1 model

Top Writer: Palmyra X5 at 8.3/10 · lineup avg 8.3/10

Writer: Palmyra X5 8.3

Ai21

1 model

Top AI21: Jamba Large 1.7 at 8.2/10 · lineup avg 8.2/10

AI21: Jamba Large 1.7 8.2

Upstage

1 model

Top Upstage: Solar Pro 3 at 8.2/10 · lineup avg 8.2/10

Upstage: Solar Pro 3 8.2

Inception

1 model

Top Inception: Mercury 2 at 8.1/10 · lineup avg 8.1/10

Inception: Mercury 2 8.1

Kwaipilot

1 model

Top Kwaipilot: KAT-Coder-Pro V2 at 8.1/10 · lineup avg 8.1/10

Kwaipilot: KAT-Coder-Pro V2 8.1

Cohere

2 models

Top Cohere: Command A at 8.0/10 · lineup avg 7.4/10

Cohere: Command A 8.0 Command R+ 6.8

Nex Agi

1 model

Top Nex AGI: Nex-N2-Pro at 7.8/10 · lineup avg 7.8/10

Nex AGI: Nex-N2-Pro 7.8

Liquid

1 model

Top LiquidAI: LFM2-24B-A2B at 7.5/10 · lineup avg 7.5/10

LiquidAI: LFM2-24B-A2B 7.5

Liquid AI

1 model

Top LFM2.5-8B-A1B at 7.5/10 · lineup avg 7.5/10

LFM2.5-8B-A1B 7.5

StepFun

1 model

Top Step 3.7 Flash at 7.5/10 · lineup avg 7.5/10

Step 3.7 Flash 7.5

Microsoft

1 model

Top Phi-4 at 6.5/10 · lineup avg 6.5/10

Phi-4 6.5