Hello, I am trying to finetune GPT-j-6b. I followed the instructions provided in the documentation. But, I get this error. I tried by changing batch size =1, gradient_accumulation_steps=4. Any idea how can i solve this.
Hello,
I am trying to finetune GPT-j-6b.
I followed the instructions provided in the documentation. But, I get this error.
I tried by changing batch size =1, gradient_accumulation_steps=4.
Any idea how can i solve this.