When using the algorithms TD3 and PVPTD3 from the baseline3 library for training, the vehicle starts spinning from the beginning