Kenneth Benoit

Results 308 comments of Kenneth Benoit

Add a new section on language models, that include word2vec or GloVe vectors, including ways to get them from spaCy models. Some packages that include transformers/huggingface library access now: -...

Can you show your output please?

You probably mean ```r > library("quanteda") > dfmat tmod summary(tmod) Length Class Mode sv 7 -none- numeric nd 1 -none- numeric rownames 14 -none- character rowmass 14 -none- numeric rowdist...

See https://github.com/r-lib/pkgdown/issues/1251

Could add a link to this page, a German-language tutorial set up by Cornelius Puschmann: http://inhaltsanalyse-mit-r.de/0_einleitung.html

Since the udpipe output is almost the same structure as that from `spacyr::spacy_parse()`, it can use the method for `as.tokens.spacyr_parsed()`. If you want the POS tag appended, you have to...

I thought of that too, then we would just need the equivalents of `docid_field = "doc_id", tokenid_field = "token", pos_field = "pos"` etc. The **udpipe** output almost matches the `spacyr_parsed`...

Thanks. More generally (and basically): ```r str(corpus("this is my single document")) ## Error in `[[.corpus`(object, 1L) : ## cannot index docvars this way because none exist ```

But keep in mind this, from `?corpus`: > #### A warning on accessing corpus elements > > A corpus currently consists of an S3 specially classed list of elements, but...

@conjugateprior Refresh with the latest GitHub version and try it now.