SMC: estimate marginal likelihood #2563

aloctavodia · 2017-09-13T17:19:16Z

The implementation follows Ching, J. and Chen, Y. (2007). Transitional Markov Chain Monte Carlo Method for Bayesian Model Updating, Model Class Selection, and Model Averaging

I am not sure were to store the value of the marginal likelihood, now I am just adding the attribute marginal_likelihood to the model (see here) Suggestions are more than welcome!

I test the code using two type of examples and both show good agreement. :

The two Gaussian example in the above reference (I test against all scenarios)
A beta-binomial model, where the Bayes Factor is computed analytically. I tried combinations of different priors.

The test for Travis is based on the current SMC test I just add one line. The are two reasons behind this decision, avoid increasing time of testing and make the minimal changes to existing code. The test corresponds to the IV scenario in Ching & Chen paper. Nevertheless, if necessary I could add other scenarios to the tests or maybe a comparison with the beta-binomial.

For the test I also increase the number of n_chains to 1000 and decrease the number of n_steps to 10, I did this because the accuracy of the marginal likelihood depends more on the number of chains than on the value of n_steps, and while n_steps = 10 seems to be too small I think is enough for the tests.

hvasbath · 2017-09-13T20:40:54Z

This is basically only a refactoring of things that have been there if I dont miss anything.
Out of curiosity and of course interest what do you need that for/ what can be done with it?

aloctavodia · 2017-09-13T22:04:08Z

Yes this is mostly a refactoring, a by product of the SMC sampler is the estimation of the marginal likelihood from the unnormalized weights, this PR just computes that quantity (see sj and step.sjs) and makes that quantity available to the user.

The marginal likelihood is used in model comparison, hypothesis testing and model averaging. Maybe you have heard about Bayes factors (the ratio of two marginal likelihood from two models). I am not a very big fan of Bayes factors, but I think people may find them useful. in fact I am using them together with WAIC in a biomolecular project.

I realize now that this PR should also add a notebook with and example of how to use SMC to compute Bayes Factors and explaining them, currently Bayes factors are just barely mentioned in the PyMC3 documentation (I will add such a notebook to this PR tomorrow).

If you check the papers suggested by @seanlaw in #2519 you will see that one of the reasons to use methods such us WHAM is to compute the "partition function" and "free energies" these quantities are very important in Statistical Mechanics/Thermodynamics, interestingly the Bayesian equivalent of the partition function is the marginal likelihood. Luckily for us, and thanks to you, we can use SMC to estimate the marginal likelihood and hence I think we don't need to implemented methods such as WHAM.

hvasbath · 2017-09-13T22:13:25Z

Thanks a lot for the explanation @aloctavodia ! I have so much to learn ... ;) . Such a notebook would be great! Cant wait to see it.

aloctavodia · 2017-09-15T21:18:42Z

Besides the new notebook I made a couple of changes in GLM-model-selection notebook

Remove Bayes Factor section
Add a comment from Watanabe to balance the comment from Avehtari :-)

hvasbath · 2017-09-16T09:39:04Z

Thats a nice notebook @aloctavodia ! Thanks a lot for writing such a detailed description. There are few grammar things here and there, but maybe one of our native speakers should correct them- to be sure it is correct ;) .

junpenglao · 2017-09-16T15:16:37Z

LGTM as well, very informative notebook on bayes factor @aloctavodia ;-)

* SMC: estimate marginal likelihood * add example marginal likelihood computation * fix typos

SMC: estimate marginal likelihood

5492f08

add example marginal likelihood computation

69788a8

fix typos

611c150

aloctavodia merged commit 02a4da0 into pymc-devs:master Sep 19, 2017

aloctavodia deleted the SMCml branch September 19, 2017 16:44

junpenglao mentioned this pull request Oct 30, 2017

SMC improve its efficiency by using samples from all (high temperature) stages #2519

Closed

ColCarroll pushed a commit that referenced this pull request Nov 9, 2017

SMC: estimate marginal likelihood (#2563)

fdef760

* SMC: estimate marginal likelihood * add example marginal likelihood computation * fix typos

reshamas mentioned this pull request Jul 13, 2022

update notebook rendering: Bayes Factor pymc-devs/pymc-examples#395

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SMC: estimate marginal likelihood #2563

SMC: estimate marginal likelihood #2563

aloctavodia commented Sep 13, 2017

hvasbath commented Sep 13, 2017

aloctavodia commented Sep 13, 2017

hvasbath commented Sep 13, 2017

aloctavodia commented Sep 15, 2017

hvasbath commented Sep 16, 2017 •

edited

Loading

junpenglao commented Sep 16, 2017

SMC: estimate marginal likelihood #2563

SMC: estimate marginal likelihood #2563

Conversation

aloctavodia commented Sep 13, 2017

hvasbath commented Sep 13, 2017

aloctavodia commented Sep 13, 2017

hvasbath commented Sep 13, 2017

aloctavodia commented Sep 15, 2017

hvasbath commented Sep 16, 2017 • edited Loading

junpenglao commented Sep 16, 2017

hvasbath commented Sep 16, 2017 •

edited

Loading