The training in the paper is roughly divided into four parts, as shown in the following figure:
What I want to know is which training stages the weights provided on the Hugging Face have gone through. For example, is it after action injection training or after SFT.
The training in the paper is roughly divided into four parts, as shown in the following figure:
What I want to know is which training stages the weights provided on the Hugging Face have gone through. For example, is it after action injection training or after SFT.