Kohei Watanabe
Kohei Watanabe
`topicmodels::LDA` is implemented using this library, which I can call directly via Rcpp: https://sourceforge.net/projects/gibbslda/files/ We can call the library in this way https://github.com/cran/topicmodels/blob/ade6dc5698f385ad222fd28aa8e90c1a4bd33cf5/R/lda.R#L134-L155 There are a lot of things going...
I noted that ISO-8859-1(Latin 1) is set as character encoding for all the European languages: https://tutorials.quanteda.io/import-data/multiple-files/ It is inappropriate because many of the European languages are not in Latin 1...
We have pages on [how to process texts in different languages](https://tutorials.quanteda.io/language-specific/). We want to cover all the languages but the following are the languages with priority. - [x] Japanese -...
It should return a `list()` of topics.
It can be called sigma. Total sigma could be used for measuring the model fit. https://github.com/koheiw/seededlda/blob/338509e7bebae69690aee24fc21dac7fc9bf2711/src/lda.h#L321
I wonder if we can help people computing inner-products by adding "product". For example https://stackoverflow.com/questions/40228592/fastest-way-to-compute-row-wise-dot-products-between-two-skinny-tall-matrices-in On the [add-product](https://github.com/koheiw/proxyC/tree/add-product) branch, ``` r r Matrix 3.938076 4.255705 4.371230 4.417503 4.530834 4.891848 10...
Add TR dictionary
I often analyze different languages in the same project. Is there a way to specify which language model to use (Japanese or Korean)? I could do this by changing `sys_dic`,...