datapusher-plus
datapusher-plus copied to clipboard
Add PDF to supported formats; summarize content and extract tags using LLM
The legacy Datapusher used to support PDFs, as messytables supported extracting tables from PDFs using pdftables.
That functionality has been removed, as well as Excel support.
We reenabled Excel support in DP+ using qsv.
We should re-enable PDF support again, not to extract tables for now (though there is tabula-rs), but to summarize the content for the Description field and suggest tags.