GLM 4.5V

by Z.ai

GLM 4.5V is Zhipu's multimodal model built on the GLM-4.5-Air architecture, pairing 106 billion total parameters (12 billion active) with native vision understanding for broad visual reasoning. Released August 2025, it reports state-of-the-art results across roughly 41 to 42 public multimodal benchmarks, covering image question answering, video understanding, document parsing, and GUI automation. The model's 65K multimodal context window and 3D-RoPE vision encoder let it process high-resolution images, extreme aspect ratios, video, and text-heavy documents together, using 3D convolutions for efficient video handling. Native multimodal function calling allows images and screenshots to be passed directly as tool parameters, simplifying agentic integration, and a hybrid thinking mode trades reasoning depth against latency across single-image and multi-image tasks. With English and Chinese support and an MIT license, GLM 4.5V suits front-end web coding, document intelligence, accessibility systems, and multimodal agents, keeping the MoE efficiency of GLM-4.5-Air for vision plus tool-driven reasoning at modest cost.

Key info

Input
Output
Features
Context window
66K
Max output
16K
Input price
$0.60 /1M
Output price
$1.80 /1M
  • US residency available
  • Zero data retention via Enterprise
  • No training by default
  • GDPR DPA available

Available routes

GLM 4.5V runs on 1 route through the Opper gateway. Compare residency, ZDR, and training posture at a glance β€” full data-handling detail per route below.

ProviderRegionZero data retentionTrainingInputOutput
USEnterpriseNo$0.60$1.80

Training posture across routes: No training on prompts by default.

Data handling per route

Each route hosting GLM 4.5V has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

Novita β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Benchmarks

Independent benchmark scores β€” composite indices for reasoning, coding, and math, plus individual eval scores where available.

Global rank#399 of 534 LLMs
TierEfficient
Output speed47 tok/s
First token10.90s
Intelligence Index7.0
Math Index15.3
Reasoning & knowledge
MMLU-Pro
75%
GPQA Diamond
57%
Humanity's Last Exam
4%
Long-context reasoning
0%
Coding
LiveCodeBench
35%
SciCode
19%
Agentic & tool use
Terminal-Bench Hard
7%
τ²-Bench Telecom
20%
Math & instruction following
AIME 2025
15%
IFBench
29%

Get started

Call GLM 4.5V through the Opper gateway with one API key. Let your coding agent set it up, or call it directly β€” Opper is drop-in compatible with the OpenAI, Anthropic, and Google AI SDKs.

Set it up with your agent

Copy this and paste it into your coding agent β€” Claude Code, Cursor, Codex, and more β€” and it'll wire up Opper for you.

Or call it directly

import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.OPPER_API_KEY,
baseURL: "https://api.opper.click/v3/compat",
});
const completion = await client.chat.completions.create({
model: "novita/zai-org/glm-4.5v",
messages: [{ role: "user", content: "Hello" }],
});
console.log(completion.choices[0].message.content);

Compare GLM 4.5V with…

Side-by-side on privacy, EU hosting, pricing, and benchmarks.

Other models from Z.ai

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
GLM 4.5V by Z.ai β€” pricing, benchmarks | Opper AI