Fixed issues with mistral-common update #112

steffi4321 · 2025-07-23T17:19:47Z

I think this fixes Issue #110 .

In one commit of mistral-common, they moved the InstructTokenizerBase to another file (see commit mistral-ai/mistral-common@2c9f1762f8824e5d821840414ecb1b9e5267ffb3).
This required a fix in the import of the InstructTokenizerBase in two files

In File "mistral-finetune/finetune/data/dataset.py", line 15, change
from mistral_common.tokens.tokenizers.sentencepiece import InstructTokenizerBase to
from mistral_common.tokens.tokenizers.instruct import InstructTokenizerBase
In File "mistral-finetune/finetune/data/tokenize.py", line 25, change
from mistral_common.tokens.tokenizers.sentencepiece import InstructTokenizerBase to
from mistral_common.tokens.tokenizers.instruct import InstructTokenizerBase

Also, afterward, I received another error concerning the updated mistral-common, which I fixed by editing
3. In File "mistral-finetune/finetune/data/tokenize.py", line 180, change validator.validate_messages(messages) to validator.validate_messages(messages, False)
4. In File "mistral-finetune/finetune/data/tokenize.py", line 330, change curr_tokens = instruct_tokenizer.encode_assistant_message(message, is_before_last_user_message=False) to curr_tokens = instruct_tokenizer.encode_assistant_message(message, is_before_last_user_message=False, continue_message=False)

With fixed 3 and 4, I compared the old code of mistral-common to the new code, and I am confident that this is the intended behavior.

mohr2@gpu added 2 commits July 23, 2025 19:16

fixed issues with mistral-common update

44d0bd1

some more issues with updated mistral-common

bf0fec5

kmk142789 approved these changes Oct 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixed issues with mistral-common update #112

Fixed issues with mistral-common update #112

Uh oh!

steffi4321 commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fixed issues with mistral-common update #112

Are you sure you want to change the base?

Fixed issues with mistral-common update #112

Uh oh!

Conversation

steffi4321 commented Jul 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants