Skip to content

Popular repositories Loading

  1. pretrain-data pretrain-data Public

    Pretraining data reconstruction scripts for Apertus

    Python 143 14

  2. apertus-tech-report apertus-tech-report Public

    Tech Report of the Apertus LLM

    138 5

  3. Megatron-LM Megatron-LM Public

    Forked from NVIDIA/Megatron-LM

    Ongoing research training transformer models at scale

    Python 49 34

  4. apertus-finetuning-recipes apertus-finetuning-recipes Public

    Python 38 17

  5. ESFM ESFM Public

    Python 32 8

  6. MoE MoE Public

    some mixture of experts architecture implementations

    Python 27 3

Repositories

Showing 10 of 107 repositories

Top languages

Loading…

Most used topics

Loading…