GLM-OCR
GLM-OCR is a compact, high-performance multimodal model released in February 2026 by Zhipu AI (Z.ai). It is specifically designed to bridge the gap between traditional OCR (character recognition) and full "Document Understanding" (layout, tables, and reasoning).
Metadata
Provider
zai-org
Modality
multimodal
API type
image2text
Source
huggingface /
zai-org/GLM-OCR
Created
2026-04-06 23:48:30 UTC
Updated
2026-04-20 11:52:08 UTC
Catalog version
2
Visibility
Published
Specifications
Parameters
1.50B
MoE
No
Max model length
2000
Image
inferx/vllm-openai:v0.19.1
Default Deploy Config
GPU count
1
vRAM
24000 MB
Summary
1xGPU 24000 MB
Recommended Use Cases
—