Dmitry Selivanov

Results 180 comments of Dmitry Selivanov

Thanks Matt, I've switched to external script for now. Do you think it worth to handle cross-device file copy/move within `nsq_to_file`?

One issue with openNLP - it is written in java, so R integration is usually painful... a lot of data transferring, rJava installation problems, etc...

NB transformation would be a nice feature (as well as #123).

Thx! Have been subscribed to offconvex blog for quite some time :-) Another thing I want to try - http://www.offconvex.org/2016/07/10/embeddingspolysemy/. I even created [rksvd](https://github.com/dselivanov/rksvd) repo to port k-svd algorithm, but...

@good-marketing, thats easy with a little bit of linear algebra :-) (however I will probably create model for this). Below I will suppose you already have `dtm` - document-term matrix...

@good-marketing splitting documents into the sentences is way to go. So we just change level of granularity of our analysis. I think this approach is 100% correct, I would go...

Yes, sure you can use is. Tokenizers just wraps `stringi` package and provides a bit more convenient interface for tokenization. 2017-07-10 12:57 GMT+04:00 Good Marketing : > Great, thanks for...

Thanks! Article looks very interesting. From my experience sgns and glove usually perform very similar. But would be interesting to compare in more detailed way.

Yes, I remember this. But the clear advantage of GloVe is that complexity is O(nnz) instead of O(D^2). As I understand proposed SGNS and SGNS-LS also suffer from having complexity...