xAI: Grok 4.20

● Excellent

by xAI Intelligent Reasoning Agentic Rank #17 of 75

8.8
/ 10

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

Choose a model to compare against xAI: Grok 4.20

Specifications

Specifications for xAI: Grok 4.20
AttributeValue
Lab xAI
Tags Intelligent Reasoning Agentic
Overall Score 8.8/10
Release Date 2026-03
Context Window 2,000,000 tokens
Input Price / 1M $1.25
Output Price / 1M $2.50
Input Modalities Text, Image, File
Output Modalities Text

Strengths

  • Industry-leading speed with reasoning capabilities
  • Massive 2M-token context window — largest on the market
  • Lowest hallucination rate claimed across all frontier models
  • Strong agentic tool calling and strict prompt adherence

Weaknesses

  • Reasoning mode adds latency — slower than non-reasoning at same speed tier
  • xAI ecosystem still smaller than OpenAI/Anthropic

Best For

  • Long-context analysis and research
  • Agentic workflows requiring reliable tool use
  • Applications where hallucination reduction is critical

Sources & Further Reading

Related Models

Scores are aggregated from public benchmarks (MMLU, HumanEval, MATH, GSM8K, LMSYS) and normalized to a 1–10 scale. Methodology →