torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 394.00 MiB

Hello, 
I am trying to finetune GPT-j-6b.
I followed the instructions provided in the documentation. But, I get this error.

I tried by changing batch size =1, gradient_accumulation_steps=4. 

Any idea how can i solve this.