You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The NIXL_LIBFABRIC_NUM_RAILS environment variable is not enforced during EFA device discovery, causing all available devices to be used regardless of the setting.
Environment
NIXL 0.8.0
AWS P5.48xlarge (32 EFA devices)
libfabric backend
File
src/utils/libfabric/libfabric_rail_manager.cpp
Symptoms
export NIXL_LIBFABRIC_NUM_RAILS=8
# Still initializes all 32 rails