HyperTag
HyperTag copied to clipboard
HyperTag - Intuitive Knowledge Management WebApp & CLI for Humans using Deep Learning & Tags
Depends on #41
First basic version: Partition video into e.g. 16 uniformly spaced (by time) sections and take a screenshot. Embed each screenshot and use average as video embedding. Advanced: Partition video with...
Right now text documents are represented as a single average embedding of all their sentences. Increase granularity / signal by vectorizing individual pages. Related to #25
This will make HyperTag accessible for a broader audience
Match semantically very similar words. For example if files are tagged with science and research is queried it should match. Definitely add a toggle to turn this feature off as...
Tesseract: - https://github.com/tesseract-ocr/tesseract/issues/263#issuecomment-536197289 Even better: Find a solid GPU accelerated OCR implementation: - https://github.com/jaidedai/easyocr (looks promising but rly aweful CPU performance and too big model sizes for my lil GPU...
When a new file is added, automatically infer tags from semantically similar existing files tags. Depends on #24
Powered by CLIP
https://textract.readthedocs.io/en/stable/#currently-supporting