`PendulumEnv` is calculating clamped torque, but using unclamped torque in subsequent calculations. i.e. we calculate: ```julia v = clamp.(u, -env.max_torque, env.max_torque) ``` but we don't use `v` in any of the following lines, and we use `u` directly.