LMRank Models

Discover the models our community benchmarks

Track latency, context, pricing, and community sentiment across the most talked about LLMs.

OP
openai • text->text
131K tokens

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) archit...

Input Price $0.04/1M
Context 131K tokens
AN
anthropic • text+image->text
200K tokens

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks....

Input Price $15.00/1M
Context 200K tokens
Vision Moderated
Details
QW
262K tokens

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for...

Input Price $0.07/1M
Context 262K tokens
QW
qwen • text->text
262K tokens

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It...

Input Price $0.07/1M
Context 262K tokens
Z-
z-ai • text->text
131K tokens

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) archite...

Input Price $0.41/1M
Context 131K tokens
Z-
z-ai • text->text
131K tokens

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5,...

Input Price $0.00/1M
Context 131K tokens
Z-
z-ai • text->text
131K tokens

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5,...

Input Price $0.14/1M
Context 131K tokens
QW
262K tokens

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tas...

Input Price $0.10/1M
Context 262K tokens
Z-
z-ai • text->text
128K tokens

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabiliti...

Input Price $0.10/1M
Context 128K tokens
QW
262K tokens

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic c...

Input Price $0.00/1M
Context 262K tokens
QW
qwen • text->text
262K tokens

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic c...

Input Price $0.25/1M
Context 262K tokens
GO
google • text+image->text
1.0M tokens

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It off...

Input Price $0.10/1M
Context 1.0M tokens
Vision
Details