obsidian-text-extractor icon indicating copy to clipboard operation
obsidian-text-extractor copied to clipboard

[BUG] PDFs are not re-extracted after they're updated

Open scambier opened this issue 2 years ago • 2 comments

Problem description:

https://github.com/scambier/obsidian-omnisearch/discussions/291#discussioncomment-7032086

Your environment:

  • Plugin version:
  • Obsidian version:
  • Operating system:
  • Number of images/PDFs in your vault (approx.):
  • Other plugins that may be related to the issue:

scambier avatar Sep 18 '23 11:09 scambier

  • A PDF is only extracted on demand (e.g. when Omnisearch wants to index it, or when using the contextual menu)
  • Once Omnisearch has built its cache, it won't re-extract (or hit the cache of) a PDF unless it appears in a result

A solution would be to add a "automatically extract" opt-in setting in Text Extractor, that would keep the cache up-to-date

scambier avatar Sep 30 '23 13:09 scambier

@scambier, what do you think about adding the modification time to the cache file name?

While it may result in a larger number of cache files, it seems like a simpler solution.

demig00d avatar Jan 17 '24 14:01 demig00d