使用strategy.model_init_context()加载chatglm-6b模型提示mismatch #3761
Unanswered
zhangyuanscall
asked this question in
Community | Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
加载代码如下所示,world_size 为4,使用单机4卡运行提示mismatch,transformer.word_embeddings.weight的维度为[130528,4096],经过chunk之后成为[130528,1024](即代码中x变量),但是p的维度为256(4096被切分为256),chatglm_model_path 为chatglm的huggingface权重
报错日志如下:

Beta Was this translation helpful? Give feedback.
All reactions