ChanghaoLau

Results 2 issues of ChanghaoLau

I get garbled characters when parsing pdf file. The file I use is [this](http://www.aas.net.cn/fileZDHXB/journal/article/zdhxb/2012/8/PDF/20120812.pdf). There may be encoding issues? ## Environment ```bash $ python -m platform Linux-4.18.0-147.5.1.6.h841.eulerosv2r9.x86_64-x86_64-with-glibc2.17 $ python -c...

workflow-text-extraction
Has MCVE

I want to extract the table in .docx file into markdown format, while maintaining the position of the table in the document. So I can't use `python-docx` `document.paragraghs` and `document.tables`...