tomotopy icon indicating copy to clipboard operation
tomotopy copied to clipboard

Interest in adding BiTerm Model (or other models for short texts)

Open BrendanKennedy opened this issue 3 years ago • 1 comments

LDA and variants are known to perform poorly on short texts. Is a model like BiTerm Model (Yan, Guo, Lan & Cheng) in the plans, or would there be interest in it?

Thanks!

Yan, X., Guo, J., Lan, Y., & Cheng, X. (2013, May). A biterm topic model for short texts. In Proceedings of the 22nd international conference on World Wide Web (pp. 1445-1456).  

BrendanKennedy avatar Dec 21 '20 05:12 BrendanKennedy

In version 0.11.0 the Pseudo-document based Topic Model was added.

Zuo, Y., Wu, J., Zhang, H., Lin, H., Wang, F., Xu, K., & Xiong, H. (2016, August). Topic modeling of short texts: A pseudo-document view. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 2105-2114)

jonaschn avatar Apr 07 '21 22:04 jonaschn