jamesdbaker
jamesdbaker
The Dockerfile works fine, and whilst I haven't tested the docker-compose or configs they look like they should work too. However, I'd suggest changing how the Docker aspects are documented,...
There is a Gitter chat room which used to be used, but hasn't been used much recently (I notice the link to it has vanished from the README for some...
Alternatively, could add an entry to the manifest to at least declare the module name
I believe this is the underlying issue in cld2: https://github.com/mikemccand/chromium-compact-language-detector/issues/22
I've used the following command to replace control characters from my dataset, using the list of characters provided by @alexgarel above. `sed 's/[\00\01\02\03\04\05\06\07\08\0b\0e\0f\10\11\12\13\14\15\16\17\18\19\1a\1b\1c\1d\1e\1f\7f\80\81\82\83\84\85\86\87\88\89\8a\8b\8c\8d\8e\8f\90\91\92\93\94\95\96\97\98\99\9a\9b\9c\9d\9e\9f]//' input.txt > output.txt` Posting it here in...