During using it, I found that the performance of using ResNet was not better than the network used in the original paper. Finally, I found that the problem comes from the downsample ratio of the network(ResNet using 32 but original paper using 16).Therefore, I made minor modifications to the resnet.py and will commit Pull requests later.