may i ask if this robot uses some reinforcement learning algorithm and if there are some simulation works before the code applying to the real robot ?