pydocx icon indicating copy to clipboard operation
pydocx copied to clipboard

`TypeError: expected string or buffer` when .doc is converted to .docx with MS Office in Windows

Open rejuashes opened this issue 8 years ago • 2 comments

pydocx_html_windows_error.txt Hi Guys,

I am facing a situation where pydocx.to_html behaves indifferently on a same .doc file which is converted to a .docx file.

Scenario 1 : .doc file is converted to .docx file using libreoffice in linux(saving as Microsoft word 2007/2010/2013 XML) - works fine.

Scenario 2 : .doc file is converted to .docx file using MS Office in windows - throws an error.

return re.match('^\s_([^\s]+)\s_(.*)$', self.instr) File "/usr/lib/python2.7/re.py", line 137, in match return _compile(pattern, flags).match(string) TypeError: expected string or buffer

Any pointers would be helpful.

regards,

Rajith

rejuashes avatar Jun 21 '16 06:06 rejuashes

Hi,

Thanks for the issue report! Could you attach the .doc converted to .docx using MS Office in windows that is throwing the error?

Thanks,

-Kyle

kylegibson avatar Jun 21 '16 16:06 kylegibson

Hi Kyle,

Attaching the original source .doc file which was converted to .docx.

regards,

rajith

ABC.zip

rejuashes avatar Jun 22 '16 06:06 rejuashes