Open
Description
I'm running classify on the MRPC dataset. In trainer.train trainer.train(get_loss,model_file,True), it allows only three arguments not 4 so I cant use the pretrain file.
Also it runs out of memory,
return x * 0.5 * (1.0 + torch.erf(x / math.sqrt(2.0)))
RuntimeError: CUDA out of memory. Tried to allocate 48.00 MiB (GPU 0; 4.00 GiB total capacity; 3.02 GiB already allocated; 43.35 MiB free; 223.00 KiB cached)
Iter (loss=X.XXX): 0%| | 0/115 [00:00<?, ?it/s]
Please help.
I'm using cfg.hidden instead of cfg.dim and a drop out probability of 0.5
Metadata
Metadata
Assignees
Labels
No labels