Pretrain release
The model had only trained on about 5B tokens at this point, not very close to done, but I thought it would be fun to release
The model had only trained on about 5B tokens at this point, not very close to done, but I thought it would be fun to release