Tracking advantages can be helpful for debugging unstable runs, we could consider tracking min, max and mean values <img width="825" height="273" alt="Image" src="https://github.com/user-attachments/assets/64f995b2-b4ce-40bb-bd04-204239d37d83" />