It doesn't work very well with this PDF.
Here's an example of a PDF it doesn't work well at all with: https://nysirestakes.com/backend/News/news_upload/2023_Breeders_Award_12123_1706.pdf
Sample prompt: What are the headers of the table?
I appreciate you reporting this! Yeah I don't think it does too well with tables to be honest since I pass it all in as just text. Perhaps a feature to implement, which is detecting tables and embedding them in a certain format
yeah its a feature to implement , detecting tables should be nice to have .
I also tried this with a research paper and it didn't work well. The pdf had tables, charts and texts. The model seemed to be hallucinating.
How about using a custom document loader like Unstructred? Unstructred is also available on Langchain.