Fix for issue #19933: langchain_huggingface #25136

Soumil32 · 2024-08-07T10:48:21Z

This partially fixes issue #19933 in which Langchain was not taking the taking steps to apply the correct formatting for different chat models.

The change involves passing the prompt in the ChatML format of the pipeline. This way the pipeline can automatically format accordingly.

In addition, the argument return_full_text=False is being passed into the pipeline in huggingace_pipeline.py. This causes the pipeline to only return the newly generated text whereas the default behaviour is to return the entire chat history which we do not need. The output can then be parsed back into an AIMessage.

This still only works when ChatHuggingFace is used. If either model_id or tokenizer are not set when initialising ChatHuggingFace , it defaults to gpt2 tokenizer which is what caused the issue in the first place. The changes I have made correctly handle parsing the model response back into AIMessage. The initial issue still might need to be investigated but doing this should work as the tokenizer should be picked up from the pipeline passed into HuggingFacePipeline.

from langchain_huggingface import ChatHuggingFace, HuggingFacePipeline
from langchain_core.messages import HumanMessage, SystemMessage
from transformers import pipeline, AutoTokenizer, AutoModelForCausalLM

model_id = "hugging-quants/Meta-Llama-3.1-8B-Instruct-BNB-NF4" #replace with your model
tokenizer = AutoTokenizer.from_pretrained(model_id)
terminators = [
    tokenizer.eos_token_id,
    tokenizer.convert_tokens_to_ids("<|eot_id|>")
]
model = AutoModelForCausalLM.from_pretrained(
  model_id,
  torch_dtype=torch.bfloat16,
  low_cpu_mem_usage=True,
  device_map="auto",
)

pipe = pipeline(
    "text-generation", model=model, tokenizer=tokenizer, max_new_tokens=40, eos_token_id=terminators, pad_token_id=tokenizer.eos_token_id
)

messages = [
    SystemMessage(content="You are a translator which converts the users input from English to French. Never refer to yourself. Just provide the translated text."),
    HumanMessage(content="Hi, how are you?")
]


hf = HuggingFacePipeline(pipeline=pipe)
chat_model = ChatHuggingFace(llm=hf, model_id=model_id)

response = chat_model.invoke(messages)

print(response)

Sorry if my language isn't clear. It is my first time contributing to any OSS project. If you have any questions or clarification is need, please say so! 😊

Edit: Did some more testing and my changes also fix the iisue the issue all together instead of partially! model_id or tokenizer do not need to be set anymore.

… in ChatML format instead of the formatted string

…to 'List[List[dict[str, str]]]' to reflect ChatML input

…ire history

vercel · 2024-08-07T10:48:26Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Aug 12, 2024 7:02pm

Soumil32 · 2024-08-08T18:04:27Z

@efriis whats the status on this?

Soumil32 · 2024-08-08T18:14:39Z

This also seems to fix issue #24437, #22487

Soumil32 · 2024-08-10T17:08:17Z

@ccurme Is there anything i still need to do?

ccurme

Could we add some tests to demonstrate the updated behavior?

ccurme · 2024-08-23T14:44:02Z

libs/partners/huggingface/langchain_huggingface/llms/huggingface_pipeline.py

@@ -269,6 +270,7 @@ def _generate(
            # Process batch of prompts
            responses = self.pipeline(
                batch_prompts,
+                return_full_text=False,


Can we do this in a separate issue / PR?

Soumil32 added 3 commits August 7, 2024 10:15

The ChatHugggingFace._to_chat_prompt now returns the 'messages_dicts'…

ed8164d

… in ChatML format instead of the formatted string

HuggingFacePileline._generate() prompt type changed from 'List[str]' …

b4f59ea

…to 'List[List[dict[str, str]]]' to reflect ChatML input

The pipeline only returns the newly genreated text instead of the ent…

bcf4dc1

…ire history

efriis added the partner label Aug 7, 2024

efriis self-assigned this Aug 7, 2024

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Aug 7, 2024

Soumil32 changed the title ~~Partial fix for issue [#19933](https://github.com/langchain-ai/langchain/issues/19933): langchain_huggingface~~ Partial fix for issue #19933: langchain_huggingface Aug 7, 2024

dosubot bot added langchain Related to the langchain package 🔌: huggingface Primarily related to HuggingFace integrations 🤖:improvement Medium size change to existing code to handle new use-cases labels Aug 7, 2024

Merge branch 'master' into huggingface-template-fix

ea6f08d

ccurme removed the langchain Related to the langchain package label Aug 7, 2024

Soumil32 changed the title ~~Partial fix for issue #19933: langchain_huggingface~~ Fix for issue #19933: langchain_huggingface Aug 7, 2024

Soumil32 mentioned this pull request Aug 8, 2024

complete prompt is appended at the start of my response generated by llama3 #24437

Open

5 tasks

Soumil32 mentioned this pull request Aug 8, 2024

LangChain Conversation Looping with Itself After Initial Greeting #22487

Open

5 tasks

Merge branch 'master' into huggingface-template-fix

96e7334

Soumil32 mentioned this pull request Aug 9, 2024

ChatHuggingFace always returns only 100 tokens as response without considering the max_new_tokens parameter #25219

Open

5 tasks

Merge branch 'master' into huggingface-template-fix

1551787

ccurme reviewed Aug 23, 2024

View reviewed changes

ccurme added the needs test PR needs to be updated with tests label Aug 23, 2024

efriis assigned ccurme and unassigned efriis Aug 24, 2024

efriis self-assigned this Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix for issue #19933: langchain_huggingface #25136

Fix for issue #19933: langchain_huggingface #25136

Soumil32 commented Aug 7, 2024 •

edited

Loading

vercel bot commented Aug 7, 2024 •

edited

Loading

Soumil32 commented Aug 8, 2024

Soumil32 commented Aug 8, 2024 •

edited

Loading

Soumil32 commented Aug 10, 2024

ccurme left a comment

ccurme Aug 23, 2024

Fix for issue #19933: langchain_huggingface #25136

Are you sure you want to change the base?

Fix for issue #19933: langchain_huggingface #25136

Conversation

Soumil32 commented Aug 7, 2024 • edited Loading

vercel bot commented Aug 7, 2024 • edited Loading

Soumil32 commented Aug 8, 2024

Soumil32 commented Aug 8, 2024 • edited Loading

Soumil32 commented Aug 10, 2024

ccurme left a comment

Choose a reason for hiding this comment

ccurme Aug 23, 2024

Choose a reason for hiding this comment

Soumil32 commented Aug 7, 2024 •

edited

Loading

vercel bot commented Aug 7, 2024 •

edited

Loading

Soumil32 commented Aug 8, 2024 •

edited

Loading