Skip to content

Commit 1f31a28

Browse files
fix: temperature + increase output len (#28)
1 parent f552a1b commit 1f31a28

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

balrog/client.py

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -184,6 +184,7 @@ def api_call():
184184
return self.client.chat.completions.create(
185185
messages=converted_messages,
186186
model=self.model_id,
187+
temperature=self.client_kwargs.get("temperature", 0.5),
187188
max_tokens=self.client_kwargs.get("max_tokens", 1024),
188189
)
189190

balrog/config/config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ client:
3030
base_url: http://localhost:8080/v1 # Base URL for the API (if using a local server)
3131
generate_kwargs:
3232
temperature: 0.0 # Sampling temperature; 0.0 makes the output deterministic
33-
max_tokens: 1024 # Max tokens to generate in the response
33+
max_tokens: 4096 # Max tokens to generate in the response
3434
timeout: 60 # Timeout for API requests in seconds
3535
max_retries: 5 # Max number of retries for failed API calls
3636
delay: 2 # Exponential backoff factor between retries in seconds

0 commit comments

Comments
 (0)