Endpoint gemma-4-31B-it-fp8
Gemma-4-31B-it-FP8 is a state-of-the-art, instruction-tuned dense model from Google, optimized for high-performance inference
Metadata
Log In To Use This Endpoint
This public page shows the published endpoint metadata and integration shape. Log in to get a tenant-scoped endpoint URL, inference API key, and the interactive playground. Log in
Integration
Use these values in Dify, OpenWebUI, Continue, OpenCode, or any OpenAI-compatible client that asks for a base URL, API key, and model name.
https://model.inferx.net/funccall/<tenant>/endpoints/gemma-4-31B-it-fp8/v1
google/gemma-4-31B-it
<INFERENCE_API_KEY>
An inference API key is required for this endpoint. Until one is available, the sample request below keeps the correct request shape and uses a placeholder token.