InferX Beta Serverless GPU Inference Platform, Built for Agent-Native Workloads

DeepSeek-Coder-V2-Lite-Instruct

A lightweight coding model designed for efficient code generation and reasoning.
Qwen text text2text reasoning coding

deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct is an instruction-tuned coding model from the DeepSeek-Coder V2 family, optimized for code generation, debugging, and reasoning across multiple programming languages, while maintaining efficient performance for cost-effective deployments and developer tools.

Log in to deploy: this public page shows the catalog model details, but deployment and customization stay behind login.
Log in to deploy

Metadata

Provider
Qwen
Modality
text
API type
text2text
Source
Created
2026-04-06 05:34:45 UTC
Updated
2026-04-13 02:57:13 UTC
Catalog version
2
Visibility
Published

Specifications

Parameters
16.00B
MoE
No
Max model length
32768
Image
vllm/vllm-openai:v0.16.0

Default Deploy Config

GPU count
1
vRAM
40000 MB
Summary
1xGPU 40000 MB

Recommended Use Cases

Model Spec