InferX Beta Serverless GPU Inference Platform, Built for Agent-Native Workloads

Z-Image-Turbo

Z-Image-Turbo is a 6-billion parameter text-to-image model released by Alibaba's Tongyi Lab (the team behind Qwen) in late 2025. It was specifically engineered to challenge the dominance of larger models like FLUX.1 by prioritizing extreme inference speed and bilingual text rendering without sacrificing photorealism.
Tongyi-MAI image text2img
Log in to deploy: this public page shows the catalog model details, but deployment and customization stay behind login.
Log in to deploy

Metadata

Provider
Tongyi-MAI
Modality
image
API type
text2img
Source
huggingface / Tongyi-MAI/Z-Image-Turbo
Created
2026-04-07 00:43:50 UTC
Updated
2026-04-07 00:43:50 UTC
Catalog version
1
Visibility
Published

Specifications

Parameters
6.00B
MoE
No
Max model length
Image
vllm/vllm-omni:v0.14.0

Default Deploy Config

GPU count
1
vRAM
45000 MB
Summary
1xGPU 45000 MB

Recommended Use Cases

Model Spec