When trying to extract text from non English PDF, characters are not extracted proper. Sample PDF attached 3.pdf