You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi authors, thank you for your excellent work of ReasonFlux-PRM!
Would you please provide some details on how to segment trajectories into several steps and how the alignment between the trajectory and the model response is established?