Skip to content

**Training script** #23

@arampacha

Description

@arampacha
  • - add bf16 support
  • - check if training with bf16 weights works fine
  • - add resuming from ckpt
  • - add wandb tracking
  • - complete adafactor option
  • - figure out how to best utilize profiler for training loop optimization
  • - add gradient accumulation
  • - support iterable datasets and max_steps argument
  • - prefetch generator for dataloader

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions