CategoricalGibbsMetropolis is slower than the older ElemwiseCategoricalStep #1563

Anjum48 · 2016-11-29T14:36:52Z

Looking at Austin Rochford's Dirichlet process mixture model which used the older ElemwiseCategoricalStep, it looks 20,000 iterations took just over 2 minutes.

Comparing to the same example in the docs which uses the newer CategoricalGibbsMetropolis, the same number of iterations takes over 14 minutes.

Does anyone know why there is such an increase in run time and if there's a way to make it faster? I've just updated my code (with the new step method) with uses the same model but on much larger data sets, and what used to take a few days to run now looks like it might span weeks :(

The text was updated successfully, but these errors were encountered:

twiecki · 2016-11-30T10:23:29Z

That's unfortunate. It's probably related to the Python loop where each element is proposed and accepted individually, rather than all at once like the previous Elemwise did (did we remove that sampler? Why not still use that?). Maybe there's a way to theanoize that loop or we bring back the older one as an alternative.

Anjum48 · 2016-11-30T21:11:49Z

It looks like ElemwiseCategorical seems to do the trick however there is a warning saying it's depreciated

ozankabak · 2016-12-02T00:42:03Z

I agree with Thomas, it probably has to do with the Python loop. BTW, ElemwiseCategorical does not support various operations (e.g. indexing), hence the message. But this is a good indication that it should be kept around until CategoricalGibbsMetropolis's performance catches up.

twiecki · 2016-12-02T08:49:14Z

Actually it's probably not just the python loop, but the fact that logp gets evaluated N times for a vector of length N, rather than just once. There's no way around that for Gibbs but it does suggest that we should keep ElemwiseCategorical, but maybe rename it to CategoricalMetropolis for consistency.

rahuldave · 2017-04-13T06:39:41Z

It is indeed super-slower. It makes doing clustering Models nigh impossible. See this screenshot for time differences on the old faithful waiting times 2 component gaussian mixture model..

twiecki · 2017-04-13T07:48:20Z

I wonder if a theano.scan could speed the loop up because it could allow theano to cache the rest of the graph.

@rahuldave Can you just use ElemwiseCategorical? The best solution would be to use a marginalize the cluster assignments: http://pymc-devs.github.io/pymc3/notebooks/marginalized_gaussian_mixture_model.html

rahuldave · 2017-04-14T01:36:51Z

Yes thats what I did. But perhaps the competences ought to currently be changed? Marginalizing, both explicitly and implicitly using the pymc API is part of tomorrows lab :-)

twiecki · 2017-04-14T07:20:20Z

Good point about changing the default.

…

On Apr 14, 2017 3:36 AM, "Rahul Dave" ***@***.***> wrote: Yes thats what I did. But perhaps the competences ought to currently be changed? Marginalizing, both explicitly and implicitly using the pymc API is part of tomorrows lab :-) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1563 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AApJmGVc2X8lc5-pDQv5VK_6AxPZERbwks5rvs20gaJpZM4K_Cp8> .

twiecki · 2017-04-15T13:08:43Z

See #2037.

twiecki mentioned this issue Apr 15, 2017

Make ElemwiseCategorical default. #2037

Closed

junpenglao added the Metropolis label May 8, 2017

junpenglao mentioned this issue Mar 1, 2018

Wrong logp function in ElemwiseCategoricalStep #2879

Closed

junpenglao mentioned this issue Sep 13, 2018

Track model_logp in other samplers #3188

Closed

canyon289 closed this as completed Sep 18, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CategoricalGibbsMetropolis is slower than the older ElemwiseCategoricalStep #1563

CategoricalGibbsMetropolis is slower than the older ElemwiseCategoricalStep #1563

Anjum48 commented Nov 29, 2016

twiecki commented Nov 30, 2016

Anjum48 commented Nov 30, 2016

ozankabak commented Dec 2, 2016

twiecki commented Dec 2, 2016

rahuldave commented Apr 13, 2017

twiecki commented Apr 13, 2017

rahuldave commented Apr 14, 2017

twiecki commented Apr 14, 2017 via email

twiecki commented Apr 15, 2017

CategoricalGibbsMetropolis is slower than the older ElemwiseCategoricalStep #1563

CategoricalGibbsMetropolis is slower than the older ElemwiseCategoricalStep #1563

Comments

Anjum48 commented Nov 29, 2016

twiecki commented Nov 30, 2016

Anjum48 commented Nov 30, 2016

ozankabak commented Dec 2, 2016

twiecki commented Dec 2, 2016

rahuldave commented Apr 13, 2017

twiecki commented Apr 13, 2017

rahuldave commented Apr 14, 2017

twiecki commented Apr 14, 2017 via email

twiecki commented Apr 15, 2017