Open
Description
https://github.com/google/maxtext/blob/main/MaxText/scratch_code/golden_llama2-70b_export.py
thats what maxtext does to guarantee that the implementation is identical to official llama 2 70B
It would be good to do the same to ensure that AXLearn models are proper implementations too.
Metadata
Metadata
Assignees
Labels
No labels