xAI: Grok 4.20
● Excellent8.8
/ 10
Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...
Specifications
| Attribute | Value |
|---|---|
| Lab | xAI |
| Tags | Intelligent Reasoning Agentic |
| Overall Score | 8.8/10 |
| Release Date | 2026-03 |
| Context Window | 2,000,000 tokens |
| Input Price / 1M | $1.25 |
| Output Price / 1M | $2.50 |
| Input Modalities | Text, Image, File |
| Output Modalities | Text |
Strengths
- Industry-leading speed with reasoning capabilities
- Massive 2M-token context window — largest on the market
- Lowest hallucination rate claimed across all frontier models
- Strong agentic tool calling and strict prompt adherence
Weaknesses
- Reasoning mode adds latency — slower than non-reasoning at same speed tier
- xAI ecosystem still smaller than OpenAI/Anthropic
Best For
- Long-context analysis and research
- Agentic workflows requiring reliable tool use
- Applications where hallucination reduction is critical
Sources & Further Reading
Related Models
Scores are aggregated from public benchmarks (MMLU, HumanEval, MATH, GSM8K, LMSYS) and normalized to a 1–10 scale. Methodology →