example-models icon indicating copy to clipboard operation
example-models copied to clipboard

Simplify LDA input parameterization

Open gokceneraslan opened this issue 6 years ago • 4 comments

I tried to simplify LDA input representation by using a simple M x V matrix of word frequencies where M and V represent number of documents and words. In the model, now instead of iterating over all words of all documents, iterations are over each element of the M x V matrix.

gokceneraslan avatar Nov 11 '18 04:11 gokceneraslan

Thanks for submitting. I've been out for a while, so haven't been able to review this, but I'll get to it ASAP.

bob-carpenter avatar Nov 17 '18 21:11 bob-carpenter

Oh, and I'd suggest adding suffixes to existing model names like _counts to indicate you're taking sufficient stats rather than the raw data.

bob-carpenter avatar Nov 17 '18 21:11 bob-carpenter

Oh, and I'd suggest adding suffixes to existing model names like _counts to indicate you're taking sufficient stats rather than the raw data.

You mean adding _counts to the new model? Because it's the one uses counts.

gokceneraslan avatar Nov 17 '18 23:11 gokceneraslan

Anything to distinguish the way in which data is coded in the two approaches. So yes, I meant keeping .stan as is and adding _counts.stan or something similar for the sufficient stats version.

On Nov 17, 2018, at 6:04 PM, Gökçen Eraslan [email protected] wrote:

Oh, and I'd suggest adding suffixes to existing model names like _counts to indicate you're taking sufficient stats rather than the raw data.

You mean adding _counts to the new model? Because it's the one uses counts.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.

bob-carpenter avatar Nov 18 '18 22:11 bob-carpenter