John Bauer
John Bauer
This should be fixed on 1.11.0
Can you say more about how it is failing? My first guess is that it's because of this change: https://github.com/stanfordnlp/CoreNLP/commit/461db9114d2b1d851cf1572c6139f21d0042d9aa Used to be, you could go to any page that...
Following up on some old issues. Did downgrading help, or was it easy enough to update the URLs internally, or is there some other work needed here?
Thanks, that's a useful observation. We can add it to the training data for MWT, and I'll have a new model ready probably Monday or so. If you find others...
I'm trying to figure out - why would the tokenized `dar` not have an accent, but `decír` does? Generally speaking, the GSD treebank we base the Spanish models from doesn't...
Actually it seems the standard in GSD is to keep the text intact but remove the accents in the lemma. Some part of me wonders if that means we can...
Hey, ran into an issue or two with the Spanish GSD dataset. Once we get that cleaned up I'll retry the models tomorrow. Guess it's time to put on my...
Gah, I apologize for how long this is taking. So in the one treebank I was looking at, GSD, the tokens keep the accents after splitting. The same is true...
Got a bunch of data improvements from the UD team. I added the examples above and recreated all of the Spanish models as combined models with both AnCora and GSD....
Now available on 1.9.0