InferX Beta Serverless GPU Inference Platform, Built for Agent-Native Workloads

GLM-OCR

GLM-OCR is a compact, high-performance multimodal model released in February 2026 by Zhipu AI (Z.ai). It is specifically designed to bridge the gap between traditional OCR (character recognition) and full "Document Understanding" (layout, tables, and reasoning).
zai-org multimodal image2text
Log in to deploy: this public page shows the catalog model details, but deployment and customization stay behind login.
Log in to deploy

Metadata

Provider
zai-org
Modality
multimodal
API type
image2text
Source
huggingface / zai-org/GLM-OCR
Created
2026-04-06 23:48:30 UTC
Updated
2026-04-20 11:52:08 UTC
Catalog version
2
Visibility
Published

Specifications

Parameters
1.50B
MoE
No
Max model length
2000
Image
inferx/vllm-openai:v0.19.1

Default Deploy Config

GPU count
1
vRAM
24000 MB
Summary
1xGPU 24000 MB

Recommended Use Cases

Model Spec