Skip to content

Commit 0164e69

Browse files
committed
Increase the max_tokens limit to address incomplete generated responses.
1 parent ce6f41a commit 0164e69

1 file changed

Lines changed: 1 addition & 1 deletion

File tree

search_engine/chatbot/llm_utilities.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -31,7 +31,7 @@ def __init__(self, model_name="meta-llama-3.1-70b-instruct", use_gpu=True):
3131
f"KISSKI LLM configured with model '{self.model_name}'. GPU usage = {self.use_gpu}."
3232
)
3333

34-
def generate_response(self, prompt, max_new_tokens=150, num_return_sequences=1):
34+
def generate_response(self, prompt, max_new_tokens=500, num_return_sequences=1):
3535
"""
3636
Generate a response from the KISSKI LLM service using the new openai>=1.0.0 Chat interface.
3737
Args:

0 commit comments

Comments
 (0)