PyStanfordDependencies
PyStanfordDependencies copied to clipboard
Handle PTB trees with Unicode words in them
Either by fixing the encoding issues or temporarily replacing them with dummy ASCII words.
Thanks to Karin M. Sim Smith for the report.
Temporary workaround: If possible, don't pass trees with Unicode words in them. This should be safe since Stanford Dependencies generally don't care about the words in the trees and the few words that it does care about are in ASCII.