Track Qwen3.5-related issues

- MCore Path
  - Add / track CP support: https://github.com/NVIDIA/Megatron-LM/pull/2645
- AutoModel Path
  - Move the FLA dependency from the dev group ([dependency-groups]) to optional extras ([project.optional-dependencies]) so that NeMo-RL can install it downstream via pkg[extra]. If FLA is not installed
    - No CP support
    - Worse performance
    - https://github.com/NVIDIA-NeMo/Automodel/pull/1894
  - Fix the default config path where Torch Adam is used without FP32 master weights, as this can slow down convergence.
    - TE FusedAdam can be used as a workaround.
    - AutoModel should correctly support / apply the FP32 master weight setting. https://github.com/NVIDIA-NeMo/Automodel/pull/1896

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Track Qwen3.5-related issues #2281

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Track Qwen3.5-related issues #2281

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions