punkt
punkt copied to clipboard
Unsupervised multilingual sentence segmentation.
update dependencies and make it build with stack
It seems that line breaks confuse Punkt. ``` segment_test :: [FilePath] -> IO () segment_test file = do content segment_test "/Users/ar/work/cpdoc/dhbb-nlp/raw/1.raw" "\171Jos\233 Machado Coelho de Castro\187 nasceu em Lorena (SP).\nEstudou...
In the tests, I found references to the brown corpus. Can you specify the original URL for download the corpus?
It wasn't installing properly, so I fixed some ambiguous type errors that were caused by OverloadedStrings.
While installing with Stack I get the following error: ``` Configuring punkt-0.1.1... Building punkt-0.1.1... Preprocessing library punkt-0.1.1... [1 of 2] Compiling NLP.Punkt.Match ( lib/NLP/Punkt/Match.hs, .stack-work/dist/x86_64-linux/Cabal-1.22.4.0/build/NLP/Punkt/Match.o ) lib/NLP/Punkt/Match.hs:24:5: Non type-variable argument...