Create a chat completion for given messages with streaming support using Google’s Gemma3 model
JWT token for authentication - Use your API token as the Bearer token
Model UUID for the request (Gemma3 model identifier)
Array of messages in the conversation
Model identifier for Gemma3
gemma3 Whether to stream the response
Sampling temperature (0.0 to 2.0)
0 <= x <= 2Maximum number of tokens to generate
1 <= x <= 8192Nucleus sampling parameter
0 <= x <= 1Top-k sampling parameter
1 <= x <= 100Sequences where the API will stop generating
Frequency penalty to reduce repetition
-2 <= x <= 2Presence penalty to encourage topic diversity
-2 <= x <= 2Repetition penalty specific to Gemma models
0.1 <= x <= 2Successful chat completion
Response for non-streaming chat completion
Unique identifier for the completion
"chatcmpl-abc123"
Object type
chat.completion "chat.completion"
Unix timestamp of when the completion was created
1699014493
The model used for completion
"gemma3"