Skip to content

NAS-140206 / 27.0 / Clarify code comments and logging messages related to disk retaste#18446

Open
themylogin wants to merge 1 commit intomasterfrom
wait-for-retastes
Open

NAS-140206 / 27.0 / Clarify code comments and logging messages related to disk retaste#18446
themylogin wants to merge 1 commit intomasterfrom
wait-for-retastes

Conversation

@themylogin
Copy link
Contributor

@themylogin themylogin commented Mar 12, 2026

The root cause of the hanging job issue was removing multiprocessing.set_start_method('spawn') when process pool was removed. This is confirmed by the absence of the bug in 26.0.0-BETA.1 where process pool is still present.

@themylogin
Copy link
Contributor Author

@themylogin
Copy link
Contributor Author

@themylogin themylogin changed the title Wait for disk.retaste job to complete NAS-140206 / 27.0 / Wait for disk.retaste job to complete Mar 12, 2026
@themylogin
Copy link
Contributor Author

time 2:00

Copy link
Contributor

@yocalebo yocalebo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The behavioral change of making the caller wait on retaste is profound and extremely intrusive. Why was that logic changed?

@themylogin
Copy link
Contributor Author

@yocalebo I thought that's what was expected. The logging message says Done retasting disks on standby node. It should say Done scheduling retasting disks on standby node then? I can roll this back.

@yocalebo
Copy link
Contributor

@yocalebo I thought that's what was expected. The logging message says Done retasting disks on standby node. It should say Done scheduling retasting disks on standby node then? I can roll this back.

Definitely not what is expected, let's just change the logging message.

@themylogin
Copy link
Contributor Author

@yocalebo what about other retaste callers, should they wait or should they just schedule the retaste?

@yocalebo
Copy link
Contributor

@yocalebo what about other retaste callers, should they wait or should they just schedule the retaste?

Just schedule it, they don't need to wait.

@themylogin themylogin changed the title NAS-140206 / 27.0 / Wait for disk.retaste job to complete NAS-140206 / 27.0 / Clarify code comments and logging messages related to disk retaste Mar 12, 2026
@themylogin themylogin requested a review from yocalebo March 12, 2026 15:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants