Z Ai: Z.ai: GLM 5V Turbo

Name: Z.ai: GLM 5V Turbo
Author: Z Ai

by Z Ai Multimodal Coding Agentic Image Input Video Input

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

Choose a model to compare against Z.ai: GLM 5V Turbo

Specifications

Specifications for Z.ai: GLM 5V Turbo
Attribute	Value
Lab	Z Ai
Tags	Multimodal Coding Agentic Image Input Video Input
Release Date	2026-04
Context Window	202,752 tokens
Input Price / 1M	$1.20
Output Price / 1M	$4.00
Input Modalities	Image, Text, Video
Output Modalities	Text

Strengths

Native multimodal vision-language understanding
Strong vision-based coding from screenshots and diagrams
Long-horizon planning with visual context
Agentic task execution across modalities

Weaknesses

Higher price than text-only GLM models
Vision features add latency

Best For

Vision-based coding and UI development
Multimodal agent workflows
Screenshot-to-code tasks

In Depth: Z.ai: GLM 5V Turbo

Summary

Z.ai: GLM 5V Turbo is an AI model from Z Ai.

Released 2026-04. It currently appears in the Overall category on LMRank and 1 other category. It supports Image, Text, Video input and produces Text output, with a context window of 202,752 tokens. Input pricing is $1.20 per 1M tokens and output is $4.00 per 1M tokens on OpenRouter.

Sources & Further Reading

OpenRouter z-ai/glm-5v-turbo

Z Ai: Z.ai: GLM 5V Turbo

Specifications

✓ Strengths

! Weaknesses

★ Best For