-
Notifications
You must be signed in to change notification settings - Fork 3.2k
Description
Describe the bug
I'm getting a megatron error when trying to import from nemo.collections.llm:
File "/workspace/nemo2_lora_ft.py", line 12, in <module> from nemo.collections import llm File "/opt/NeMo/nemo/collections/llm/__init__.py", line 52, in <module> from nemo.collections.llm.gpt.model import ( # noqa: F401 File "/opt/NeMo/nemo/collections/llm/gpt/model/__init__.py", line 65, in <module> from nemo.collections.llm.gpt.model.hyena import ( File "/opt/NeMo/nemo/collections/llm/gpt/model/hyena.py", line 34, in <module> from nemo.collections.llm.gpt.model.megatron.hyena.hyena_model import HyenaModel as MCoreHyenaModel File "/opt/NeMo/nemo/collections/llm/gpt/model/megatron/hyena/hyena_model.py", line 30, in <module> from megatron.core.process_groups_config import ProcessGroupCollection ImportError: cannot import name 'ProcessGroupCollection' from 'megatron.core.process_groups_config' (/opt/megatron-lm/megatron/core/process_groups_config.py)
Steps/Code to reproduce bug
Run:
from nemo.collections import llm
Environment overview (please complete the following information)
docker run --rm -it --gpus all --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 nvcr.io/nvidia/nemo:25.09.nemotron_nano_v2_v
Additional context
Ran on A10G