docs icon indicating copy to clipboard operation
docs copied to clipboard

SUG 9.5 suggests a degenerate model

Open jsocolar opened this issue 4 years ago • 1 comments

In section 9.5, the full Bayes for the correlated topic model uses a mvnormal variate transformed by a softmax as probabilities. The problem is that the inverse softmax doesn't work as a link function because it isn't unique. My impression from a brief perusal of Aitchison & Shen 1980 is that the solution is to fix one element of the link-scale parameter vector to zero, which identifies which unique inverse-softmax we're dealing with as our link function.

The model of Blei & Lafferty (also cited in the SUG) appears to circumvent the problem by using the maximum a posteriori estimate for the link-scale mvnormal. Although the posterior is degenerate, it still has a maximum thanks to the prior, and once the estimate is fixed to that maximum the degeneracy disappears.

jsocolar avatar Dec 20 '21 16:12 jsocolar

Just to note that in the new structure of the SUG, this issue is in the "correlated topic model" subsection of the "clustering models" section of the example models.

The exact same nonidentifiability issue in the softmax is adequately addressed under the "identifiability" paragraph heading of the "multi-logit regression" subsection of the "regression models" section of the example models; perhaps we could borrow some text there or just link back to that text in the correlated topic model section.

jsocolar avatar Feb 23 '25 15:02 jsocolar