-
Notifications
You must be signed in to change notification settings - Fork 80
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Convergence testing with distance travelled #467
base: master
Are you sure you want to change the base?
Conversation
Pull Request Test Coverage Report for Build 13121271741Details
💛 - Coveralls |
Hi @ColmTalbot , Thanks for the updated patches.
|
Hi, @ColmTalbot Thanks for the patches. I looked at them and I am certainly a fan of the logl insertion index statistic in particular. It would be great to have that convergence test. I have however a suggestion of somewhat different implementation to the one you are proposing.The current implementation is in my opinion somewhat invasive, as it adds a bunch of keywords/variables specific to your convergence statistics that needs to be stored during sampling. In my understanding, if we, for example, store the iteration at which each point is added, i.e. Basically my thinking is that I'd prefer these convergence statistics to be computed after the run (outside the sampler) and we just need to ensure the relevant information is saved, so we can properly retrace the steps of the sampler. Thanks |
I had initially hoped that this could be done in post-processing, but the distance check depends on the point used to start the MCMC chain, which would require modifying The likelihood test can (and is) done is post using currently available information. I'm not sure how they deal with dynamic mode. One advantage to having these statistics be computed during the run is that for long-running tasks it can save significant time to know after a day that the convergence is bad, rather than waiting a week. |
I agree that it can be useful to know some convergence statistics during the run. I would put that lower in priority for the moment. |
7276a60
to
5cc6af9
Compare
5cc6af9
to
cd2e6cb
Compare
cd2e6cb
to
058f7a3
Compare
058f7a3
to
d9eed57
Compare
d9eed57
to
8214eb5
Compare
Reference issue
Motivated by discussion #365 and #366
What does you PR implement/fix ?
Hi @segasai, first of all I'm sorry for my extended absence on #365 and #366 I went down a deep rabbit hole related to the discussion in #365 and that combined with other work/life commitments didn't leave much time for implementing this. I think I've got to the bottom of this and if you're still willing I'd like to get back into this.
I've attached a draft describing a new test that uses the distance travelled in the MCMC as a diagnostic that is implemented in this branch.
I think this test and toy model problems we've discussed before are an excellent pair to put together a set of tests for potential changes to the sampling/MCMC methods.
nested_sampling_distance_test.pdf
I've included a new plot that shows the likelihood/distance insertion index
I've also been experimenting with plots that will show evolution of the KL divergence estimate, e.g.,
