Why I get the different results between eval mode and test mode? 