Skip to content

Commit 5441b80

Browse files
authored
chore: increase nvl monitor validate freq (#527)
## Description <!-- Describe what this PR does --> Increase the frequency of the validate_and_sync_nmxm_info function, which cleans up stale partitions and fixes other issues stemming from NMX-M problems. ## Type of Change <!-- Check one that best describes this PR --> - [ ] **Add** - New feature or capability - [ ] **Change** - Changes in existing functionality - [ ] **Fix** - Bug fixes - [ ] **Remove** - Removed features or deprecated functionality - [x] **Internal** - Internal changes (refactoring, tests, docs, etc.) ## Related Issues (Optional) <!-- If applicable, provide GitHub Issue. --> ## Breaking Changes - [ ] This PR contains breaking changes <!-- If checked above, describe the breaking changes and migration steps --> ## Testing <!-- How was this tested? Check all that apply --> - [ ] Unit tests added/updated - [ ] Integration tests added/updated - [ ] Manual testing performed - [ ] No testing required (docs, internal refactor, etc.) ## Additional Notes <!-- Any additional context, deployment notes, or reviewer guidance --> Signed-off-by: Thomas McRoberts <tmcroberts@nvidia.com>
1 parent 0affc7c commit 5441b80

File tree

1 file changed

+2
-2
lines changed
  • crates/api/src/nvl_partition_monitor

1 file changed

+2
-2
lines changed

crates/api/src/nvl_partition_monitor/mod.rs

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -860,11 +860,11 @@ impl NvlPartitionMonitor {
860860
HashMap<MachineId, Option<MachineNvLinkInfo>>,
861861
Vec<db::nvl_partition::NvlPartition>,
862862
)> {
863-
// Only run validation once per hour.
863+
// Only run once every 15 minutes.
864864
{
865865
let last_validation = self.last_nvlink_info_validation.lock().unwrap();
866866
if let Some(last_time) = *last_validation
867-
&& last_time.elapsed() < std::time::Duration::from_secs(3600)
867+
&& last_time.elapsed() < std::time::Duration::from_secs(900)
868868
{
869869
return Ok((machine_nvlink_info, db_nvl_partitions));
870870
}

0 commit comments

Comments
 (0)