Thomas Proisl
Thomas Proisl
I did a few timing experiments and this is what I found: - Start-up overhead for the 'spawn' method scales linearly with the number of processes; i.e. creating 8 worker...
Of course, ten sentences containing three instances (with two of them belonging together) is not enough to get a robust estimate for the tagging accuracy. Nevertheless, let's take a closer...
Der Konstruktor hat jetzt die Option `use_nfkc` und die Kommandozeilenschnittstelle die Option `--use-nfkc`. Damit wird der Input für die interne Repräsentation nach NFKC normalisiert. Für die Ausgabe werden die originalen...
Could you share the first few couple of lines from your input file?
I don’t know if GitHub messed with the formatting, but it seems like the third token is a newline character? The `txtcomplexity` tool assumes that token information is on a...
Hi, [this comment](https://github.com/tsproisl/textcomplexity/issues/2#issuecomment-919163369) tries to give a starting point for using the package from Python. Is that enough to get you started?
Thank you. Did you use the `txtcomplexity` script? Since most of the measures are dependent on the length of the input text, the script divides each text into parts (“windows”)...
The input you are using is actually not an example for the CoNLL-U format but for the custom tsv format (I've tried to make this clearer in the README). This...
This problem seems to occur if you haven't used stanza before and there is no resources.json file, yet. I've updated the script to check for the file and to download...
Unfortunately, there is no documentation yet on how to use the module from Python. The code in [cli.py](https://github.com/tsproisl/textcomplexity/blob/master/textcomplexity/cli.py) implements the command line interface and could serve as inspiration. I put...