tomotopy
tomotopy copied to clipboard
Interest in adding BiTerm Model (or other models for short texts)
LDA and variants are known to perform poorly on short texts. Is a model like BiTerm Model (Yan, Guo, Lan & Cheng) in the plans, or would there be interest in it?
Thanks!
Yan, X., Guo, J., Lan, Y., & Cheng, X. (2013, May). A biterm topic model for short texts. In Proceedings of the 22nd international conference on World Wide Web (pp. 1445-1456).
In version 0.11.0 the Pseudo-document based Topic Model was added.
Zuo, Y., Wu, J., Zhang, H., Lin, H., Wang, F., Xu, K., & Xiong, H. (2016, August). Topic modeling of short texts: A pseudo-document view. In Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining (pp. 2105-2114)