paper-qa
paper-qa copied to clipboard
maybe_is_text() failed in Chinese or other language string
string.printable only check ASCII for other language, maybe decode with Unicode, but before that, should detect the encoding reference: https://chardet.readthedocs.io/en/latest/usage.html#basic-usage
by the way, Is it necessary to check maybe_is_text()?
You can add disable_check=True to docs.add