gemma-4-E2B-it
An efficient Gemma 4 model optimized for strong performance with lower resource usage
google/gemma-4-E2B-it is an instruction-tuned model from Google's Gemma 4 family using an efficient E2B architecture to deliver strong reasoning, coding, and conversational performance while reducing compute and memory requirements, making it ideal for cost-efficient production deployments and scalable agents
Metadata
Provider
google
Modality
multimodal
API type
image2text
Source
huggingface /
google/gemma-4-E2B-it
Created
2026-04-04 00:01:00 UTC
Updated
2026-04-13 16:49:51 UTC
Catalog version
4
Visibility
Published
Specifications
Parameters
—
MoE
No
Max model length
20000
Image
vllm/vllm-openai:gemma4
Default Deploy Config
GPU count
1
vRAM
26000 MB
Summary
1xGPU 26000 MB
Recommended Use Cases
—