@imgeorgiev I have two question 1. How do you evaluate the trained policy? 2. How do you visualize trained policy?