bambi Bambi slow one model but not the other even though performance should be identical

Bambi slow one model but not the other even though performance should be identical

Open canyon289 opened this issue 2 years ago • 2 comments

For some reason Bambi is slow on the second model but not the first. Expectation is run times are similar. The large discrepancy is quite confusing

Full notebook attached

model_age_videogames = bmb.Model("converted['Yes'] ~ age + likes_videogames", df, link="logit", family="bernoulli")
idata_age_videogames = model_age_videogames.fit(draws=2000, target_accept=0.85, random_seed=SEED)

model_age_videogames = bmb.Model("converted['Yes'] ~ age + likes_videogames+0", df, link="logit", family="bernoulli")
idata_age_videogames = model_age_videogames.fit(draws=2000, target_accept=0.85, random_seed=SEED)

https://gist.github.com/canyon289/5e6dcc218788adcb4712a34f10be377e

Sep 14 '22 02:09 canyon289

Thanks for the detailed gist :)

Sep 15 '22 22:09 tomicapretto

I am also facing this issue. Running the exact same model run 'back to back' gives the same outcome as what you have observed -- the first model converges, the second does not and takes forever to sample only to give different results for all of the values in arviz.summary(trace) once it has finished sampling. You don't show the result of arviz.summary(trace) in your example, but I bet the R_hat's on the second model are >>1, indicating serious sampling issues. I feel like pymc must be doing some sort of unintended warm start when Babmi is used. I do not see any issues when I use pymc alone is used.

Mar 14 '23 21:03 kelleyjbrady

bambi bambi copied to clipboard

Bambi slow one model but not the other even though performance should be identical

bambi
bambi copied to clipboard