Conversation
…, caused by double initialization of the optimizer Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>
|
crash logs. |
|
and crash like caused the optimizer is double initializated if tp is enabled |
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update. |
|
the test failure in ci has nothing to do with the PR. |
fix issue when using
accelerate launch --num-processes 4 nd_parallel.py --dp-shard-size 2 --tp-size 2