langchain
langchain copied to clipboard
feat: add loader for open office odt files
ODF File Loader
Adds a data loader for handling Open Office ODT files. Requires unstructured>=0.6.3.
Testing
The following should work using the fake.odt example doc from the unstructured repo.
from langchain.document_loaders import UnstructuredODTLoader
loader = UnstructuredODTLoader(file_path="fake.odt", mode="elements")
loader.load()
loader = UnstructuredODTLoader(file_path="fake.odt", mode="single")
loader.load()
Is it possible to add tests in tests/integration_tests/document_loaders/test_odt.py
and an example in the Jupyter notebook?
@leo-gan - Sure thing! Just added.