Skip to content
This repository was archived by the owner on Apr 17, 2023. It is now read-only.
This repository was archived by the owner on Apr 17, 2023. It is now read-only.

Error while training StyleGAN with CIFAR-10 #124

Open
@hexiangdong2017

Description

@hexiangdong2017

(fb_gan_zoo) root@f56c103c5607:~/pytorch_GAN_zoo# python train.py StyleGAN -c config_cifar10.json --restart -n cifar10
Setting up a new session...
Running StyleGAN
size 10
50000 images found
AC-GAN classes :
{'Main': {'order': 0, 'values': ['horse', 'deer', 'automobile', 'cat', 'frog', 'ship', 'airplane', 'truck', 'dog', 'bird']}}

size 10
50000 images found
50000 images detected
size (8, 8)
50000 images found
Changing alpha to 0.000
/root/pytorch_GAN_zoo/models/base_GAN.py:278: UserWarning: This overload of add_ is deprecated:
add_(Number alpha, Tensor other)
Consider using one of the following signatures instead:
add_(Tensor other, *, Number alpha) (Triggered internally at /pytorch/torch/csrc/utils/python_arg_parser.cpp:882.)
avg_p.mul_(0.999).add_(0.001, p.data)
Traceback (most recent call last):
File "train.py", line 137, in
GANTrainer.train()
File "/root/pytorch_GAN_zoo/models/trainer/progressive_gan_trainer.py", line 235, in train
status = self.trainOnEpoch(dbLoader, scale,
File "/root/pytorch_GAN_zoo/models/trainer/gan_trainer.py", line 486, in trainOnEpoch
allLosses = self.model.optimizeParameters(inputs_real,
File "/root/pytorch_GAN_zoo/models/base_GAN.py", line 249, in optimizeParameters
self.classificationPenalty(predFakeD,
File "/root/pytorch_GAN_zoo/models/base_GAN.py", line 563, in classificationPenalty
loss.backward(retain_graph=True)
File "/root/anaconda3/envs/fb_gan_zoo/lib/python3.8/site-packages/torch/tensor.py", line 221, in backward
torch.autograd.backward(self, gradient, retain_graph, create_graph)
File "/root/anaconda3/envs/fb_gan_zoo/lib/python3.8/site-packages/torch/autograd/init.py", line 130, in backward
Variable._execution_engine.run_backward(
RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [512, 512]], which is output 0 of TBackward, is at version 2; expected version 1 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradient, with torch.autograd.set_detect_anomaly(True).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions