pymc-examples WIP ZeroSumNormal example notebook

Here is the first draft of a notebook which showcases a new PyMC distribution, ZeroSumNormal created by @aseyboldt.

A few things to note:

~this runs under v4 and it imports ZeroSumNormal from a module ZeroSumNormal.py. The idea is that after this pull request https://github.com/pymc-devs/pymc3/pull/4776 is done we can get rid of this import and just call pm. ZeroSumNormal.~
this runs under v3 and it imports ZeroSumNormal from a module ZeroSumNormal.py. The idea is that after this pull request https://github.com/pymc-devs/pymc3/pull/4776 is done we can get rid of this import and just call pm. ZeroSumNormal.
the end of the notebook is purposefully vague as I believe @aseyboldt has some suggestions to make.
Adrian: I've tried to make sure credit and attribution is very clear but do let me know if you'd like anything changed.

To be clear, I'm not anticipating this pull request to be merged until https://github.com/pymc-devs/pymc3/pull/4776 is done and the notebook can call pm.ZeroSumNormal natively.

Open to comments and feedback.

TODO

[x] Revise introduction. Greater emphasis on categorical coding and differences in the Frequentist and Bayesian approach
[ ] What is the intuition behind ZeroSumNormal doing better than the manual sum to zero constraint?
[ ] Add 2x3 ANOVA style example
[ ] (maybe) Add multinomial/ordinal regression example
[ ] "How does ZeroSumNormal differ from the manually implementation in a way that it solves the problems above? Does this differ mathematically from the previous model?" from @tomicapretto
[ ] "I'm curious what this does different than the manual approach above that leads to better convergence?" from @twiecki

Aug 16 '21 17:08 drbenvincent

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Aug 16 '21 17:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

mjhajharia commented on 2021-08-18T00:14:14Z ----------------------------------------------------------------

this is great!! I also really like the lucid writing style in the notebook :))

drbenvincent commented on 2021-08-18T13:51:51Z ----------------------------------------------------------------

Thanks

Aug 18 '21 00:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

mjhajharia commented on 2021-08-18T00:14:15Z ----------------------------------------------------------------

theano: not installed we wouldn't really want it in v4 right? (might be wrong, just curious)

drbenvincent commented on 2021-08-19T08:32:26Z ----------------------------------------------------------------

Have changed to the latest ZeroSumNormal code now (which is currently only compatible with v3. So I've left in Theano for now

Aug 18 '21 00:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

twiecki commented on 2021-08-18T10:38:05Z ----------------------------------------------------------------

deflectsions typo

drbenvincent commented on 2021-08-19T08:28:55Z ----------------------------------------------------------------

fixed

Aug 18 '21 10:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

twiecki commented on 2021-08-18T10:38:06Z ----------------------------------------------------------------

god -> good

Aug 18 '21 10:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

twiecki commented on 2021-08-18T10:38:06Z ----------------------------------------------------------------

Why does this look so terrible?

drbenvincent commented on 2021-08-18T10:48:04Z ----------------------------------------------------------------

We're at the limits of floating point precision? I'm not sure this plot is strictly required. I could switch it out for an is_close assertion. Speaking to Adrian later today, so will discuss.

drbenvincent commented on 2021-08-19T08:30:55Z ----------------------------------------------------------------

Have removed this and replaced with np.allclose(trace_centered.posterior.β.stack(sample=("chain", "draw")).sum("groups").values, 0.0)

OriolAbril commented on 2021-08-23T12:24:57Z ----------------------------------------------------------------

I think np.allclose can take dataarrays directly and that it doesn't care if the input is 1d or multidimensional so I believe np.allclose(trace_centered.posterior.β.sum("groups"), 0.0) will work

drbenvincent commented on 2021-09-05T10:55:42Z ----------------------------------------------------------------

Yes - have switched to that. It will appear in an upcoming commit.

Aug 18 '21 10:08 review-notebook-app[bot]

View / edit / reply to this conversation on ReviewNB

twiecki commented on 2021-08-18T10:38:07Z ----------------------------------------------------------------

I'm curious what this does different than the manual approach above that leads to better convergence?

drbenvincent commented on 2021-08-19T08:31:40Z ----------------------------------------------------------------

Might have to refer to Adrian for an answer. Have added this question to a list of points to address in updates

Aug 18 '21 10:08 review-notebook-app[bot]

We're at the limits of floating point precision? I'm not sure this plot is strictly required. I could switch it out for an is_close assertion. Speaking to Adrian later today, so will discuss.

pymc-examples pymc-examples copied to clipboard

WIP ZeroSumNormal example notebook

pymc-examples
pymc-examples copied to clipboard