agentai icon indicating copy to clipboard operation
agentai copied to clipboard

Add multiple docs format support

Open NirantK opened this issue 2 years ago • 0 comments

Use: https://github.com/Unstructured-IO/unstructured

For reference, look at how SQLite Utils are organised. These are optional and someone not using them — never needs to care about them

Desired Workflow: Someone points to a pdf file → we read it → chunk it with context → convert to a JSON/Python object

Problem Motivation: Albus

Remember to mark this as an optional dependency in pyproject.toml

NirantK avatar Jul 20 '23 22:07 NirantK