PyStanfordDependencies icon indicating copy to clipboard operation
PyStanfordDependencies copied to clipboard

Handle PTB trees with Unicode words in them

Open dmcc opened this issue 10 years ago • 0 comments

Either by fixing the encoding issues or temporarily replacing them with dummy ASCII words.

Thanks to Karin M. Sim Smith for the report.

Temporary workaround: If possible, don't pass trees with Unicode words in them. This should be safe since Stanford Dependencies generally don't care about the words in the trees and the few words that it does care about are in ASCII.

dmcc avatar Jan 20 '15 15:01 dmcc