-
Notifications
You must be signed in to change notification settings - Fork 37
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix deadlock related to safepoint sync. #279
Conversation
`gc_lock` now never does safepoint check. We always explicitly enter safepoint via `ThreadBlockInVM`. The VM companion thread now lets `VM_MMTkSTWOperation` (executed by the VM thread) transition the state to `_threads_resumed` instead of doing it after `VMThread::execute` returns. This works around a problem where `VMThread::execute` continues to block the companion thread when the VM thread starts executing other VM operations. Some minor style changes: We consistently use the constant `Mutex::_no_safepoint_check_flag` instead of `true` when acquiring or waiting for the `Monitor`. Slightly updated comments and logs. Logging statements no longer include thread IDs because thread IDs can be turned on by a command line option.
lusearch used to cause deadlock, and this PR should fix it.
Why was this never a problem when we had the coordinator thread? Also it'd be good to get @tianleq to take a look at this PR since he's spent a lot of time understanding the internal safepoint thread mechanism in HotSpot. |
When we had the coordinator thread, After removing the coordinator thread, we removed the mutex and condvar in However, the synchronization problem with OpenJDK's VMThread blocked the "companion thread", which in turn blocked the worker executing So the reason why this problem only manifests after we removed the coordinator thread is that our synchronization mechanism in mmtk-core changed. This problem is very likely to reproduce when executing One reason why
In comparison, in DaCapo 2006, it doesn't execute any I think the |
I added @wenyuzhao as the other reviewer, and removed myself from the list. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
gc_lock
now never does safepoint check. We always explicitly enter safepoint viaThreadBlockInVM
.The VM companion thread now lets
VM_MMTkSTWOperation
(executed by the VM thread) transition the state to_threads_resumed
instead of doing it afterVMThread::execute
returns. This works around a problem where the VM thread fails to unblock the companion thread fromVMThread::execute
after finishing evaluatingVM_MMTkSTWOperation
.Also added the
lusearch
benchmark from DaCapo 2006 to normal tests. This test case is known to cause deadlock before, and this PR should fix it.Some minor style changes:
We consistently use the constant
Mutex::_no_safepoint_check_flag
instead oftrue
when acquiring or waiting for theMonitor
.Slightly updated comments and logs. Logging statements no longer include thread IDs because thread IDs can be turned on by a command line option.
Fixes: #278