> 我也有这个问题,作者的实现的差不多5个G,测试准确率有78.9, pytorch官方实现差不多1.4个G,测试集准确率也只有60%,,请问有看到问题在哪吗?
# my implementation
>>> from models.resnet import resnet50
>>> net = resnet50()
>>> sum(p.numel() for p in net.parameters())
23705252
# torchvision implementation
>>> from torchvision.models import resnet50
>>> net = resnet50()
>>> sum(p.numel() for p in net.parameters())
25557032