private-gpt icon indicating copy to clipboard operation
private-gpt copied to clipboard

Add .doc .ppt (Word and PowerPoint 97/2003 formats)

Open Fabio3rs opened this issue 2 years ago • 2 comments

To avoid manually converting documents saved in old format, add for the loader to load these as well

Fabio3rs avatar May 17 '23 15:05 Fabio3rs

Thanks for adding those - I was not sure that these filters work for doc/ppt as well.

I tried to keep the lists in alphabetical order - doc is before docx, and ppt before pptx. It will avoid future false identifications of changes if someone decides to sort them.

mdeweerd avatar May 18 '23 01:05 mdeweerd

Thanks for adding those - I was not sure that these filters work for doc/ppt as well.

I tried to keep the lists in alphabetical order - doc is before docx, and ppt before pptx. It will avoid future false identifications of changes if someone decides to sort them.

Thanks, I changed the order.

Seems to work fine here with the version in the requirements.txt, apparently the library converts the files into docx/pptx in background to use it.

Fabio3rs avatar May 18 '23 01:05 Fabio3rs