Hello,
I had some questions while reading ZeroBubbleVPipeScheduler, especially schedule W. (As the title)
Can you plz explain exactly how this part is implemented ?

In detailed:
- Where it strips the weight grad ?
- Where is the weight grad calculated? (in schedule B, backward_step ?)