-
Notifications
You must be signed in to change notification settings - Fork 179
Open
Description
Hello, I'm interested in Cogview implemenation.
Actually I have problem in executing scripts/pretrain_single_node.sh
After 20,000 training iteration and executing ./scripts/text2image.sh, but there is no exact of shape for bird or animal in the result image. The text is 飞鹰(Flying eagle)

For training, I executed the shell script only changing the variable, NUM_GPUS_PER_WORKER from 8 to 1 since I have only one GPU instance.
I'm using Google Colab(A100 is supported).
- NUM_GPUS_PER_WORKER=1
- Image tokenizer number of tokens : 8192
- The Number of layers : 12
- The size of hidden layer : 1024
- The number of attention heads : 16
mp_rank_00_model_state.pt is the result file of training and the size of pt(pytorch) file is only 2.76Giga bytes.
Of course, if I use the pretrained model, cogview-base.tar, then the result is all right.

Please check my question and I hope you can give any advise or comment.
Metadata
Metadata
Assignees
Labels
No labels