Skip to content

ValueError: Checkpoint model type 'deepseek_v32' not recognized by Transformers. #1266

@chensuyue

Description

@chensuyue

Model quantization with AutoRound failed.

Impact Model list

deepseek-ai/DeepSeek-Math-V2
deepseek-ai/DeepSeek-V3.2
deepseek-ai/DeepSeek-V3.2-Speciale

Issue details

Traceback (most recent call last):
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/transformers/models/auto/configuration_auto.py", line 1325, in from_pretrained
    config_class = CONFIG_MAPPING[config_dict["model_type"]]
                   ~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/transformers/models/auto/configuration_auto.py", line 1028, in __getitem__
    raise KeyError(key)
KeyError: 'deepseek_v32'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/home/uttest/miniforge3/envs/autoround_test/bin/auto-round", line 10, in <module>
    sys.exit(run())
             ^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/auto_round/__main__.py", line 869, in run
    tune(args)
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/auto_round/__main__.py", line 600, in tune
    autoround: BaseCompressor = AutoRound(
                                ^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/auto_round/autoround.py", line 177, in __new__
    ar = dynamic_compressor(
         ^^^^^^^^^^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/auto_round/compressors/base.py", line 230, in __init__
    model, tokenizer = llm_load_model(
                       ^^^^^^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/auto_round/utils/model.py", line 291, in llm_load_model
    model = model_cls.from_pretrained(
            ^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/transformers/models/auto/auto_factory.py", line 318, in from_pretrained
    config, kwargs = AutoConfig.from_pretrained(
                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/uttest/miniforge3/envs/autoround_test/lib/python3.12/site-packages/transformers/models/auto/configuration_auto.py", line 1327, in from_pretrained
    raise ValueError(
ValueError: The checkpoint you are trying to load has model type `deepseek_v32` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

You can update Transformers with the command `pip install --upgrade transformers`. If this does not work, and the checkpoint is very new, then there may not be a release version that supports this model yet. In this case, you can get the most up-to-date code by installing Transformers from source with the command `pip install git+https://github.com/huggingface/transformers.git`

Sub-issues

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions