Kenneth Benoit

Results 308 comments of Kenneth Benoit

I propose the same look (colour, font) as [quanteda](https://github.com/kbenoit/quanteda)

See the stuff I started in https://github.com/kbenoit/quanteda/tree/dev_unicodeNorm for addressing this issue.

Here's the only function from that branch, which you can work into **readtext**: (and now I can delete the branch from **quanteda**: ``` r ## internal function to perform unicode...

Putting this issue aside for now; see branch [`feature/unicode_normalisation`](https://github.com/kbenoit/readtext/tree/feature/unicode_normalisation).

Not a bad idea. What format would this take? Would it write one .txt file per document, to a specified output folder? For instance: ```r writetext(x, path) ``` where the...

Great idea. There is a package called https://github.com/michaelgavin/tei2r/tree/master/R, but it looks pretty inactive.

Looks great except the encoding issue: ``` Here are some special characters: �t �rv�ztqrQ �tvef�r�g�p, which means five flood resistant hammer drills (J) in Hungarian. ```

yes just opened an issue - moved to https://github.com/ropensci/unrtf/issues/1.

Good suggestions, thanks. You're talking about familiarity with the command line and how operating systems provide file locations, but we could provide examples for Windows and other operating systems.

It wasn't spaces, it was "invalid" files caused by macOS and the "Compress" option from Finder. ```bash KBMacBook:readtext kbenoit$ unzip -l ~/Dropbox/QUANTESS/corpora/Presidential\ Debates/presidential_debates_2015-2016.zip Archive: /Users/kbenoit/Dropbox/QUANTESS/corpora/Presidential Debates/presidential_debates_2015-2016.zip Length Date Time Name...