-
Notifications
You must be signed in to change notification settings - Fork 219
Open
Description
- - add bf16 support
- - check if training with bf16 weights works fine
- - add resuming from ckpt
- - add wandb tracking
- - complete adafactor option
- - figure out how to best utilize profiler for training loop optimization
- - add gradient accumulation
- - support iterable datasets and max_steps argument
- - prefetch generator for dataloader
Metadata
Metadata
Assignees
Labels
No labels