clip-retrieval
clip-retrieval copied to clipboard
expand clip filter
- [ ] also copy texts
- [ ] add options to check image/text matching
--matching_threshold 0.2
also allow filtering by an image ? a set of image/text ?
outputting filtered parquet files would make a lot of sense
(and possibly webdatasets)