Skip to content

make training loop compatible with new data sampler

69f5617
Select commit
Loading
Failed to load commit list.
Open

(feat) add attention logits to model output, add attention soft_cap to vanilla attention; (fix) DP sharding of batch, update dtype of memory tracking interval #209

make training loop compatible with new data sampler
69f5617
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs