File tree Expand file tree Collapse file tree 2 files changed +2
-1
lines changed Expand file tree Collapse file tree 2 files changed +2
-1
lines changed Original file line number Diff line number Diff line change @@ -184,6 +184,7 @@ def api_call():
184184 return self .client .chat .completions .create (
185185 messages = converted_messages ,
186186 model = self .model_id ,
187+ temperature = self .client_kwargs .get ("temperature" , 0.5 ),
187188 max_tokens = self .client_kwargs .get ("max_tokens" , 1024 ),
188189 )
189190
Original file line number Diff line number Diff line change @@ -30,7 +30,7 @@ client:
3030 base_url : http://localhost:8080/v1 # Base URL for the API (if using a local server)
3131 generate_kwargs :
3232 temperature : 0.0 # Sampling temperature; 0.0 makes the output deterministic
33- max_tokens : 1024 # Max tokens to generate in the response
33+ max_tokens : 4096 # Max tokens to generate in the response
3434 timeout : 60 # Timeout for API requests in seconds
3535 max_retries : 5 # Max number of retries for failed API calls
3636 delay : 2 # Exponential backoff factor between retries in seconds
You can’t perform that action at this time.
0 commit comments