tom white comments

Results 70 comments of


                                            tom white

minimizing difference across image and text features

We have a version of BLIP loss this we plan on adding to an upcoming release. So far in our testing the BLIP guided loss works but doesn't "outperform" CLIP...

minimizing difference across image and text features

Thanks for the feedback! Glad to work on this a bit more with you if you are interested. In response to your questions: 1) This current version uses ITM only...

Is there a simple way to get inference results on video files that are not part of the considered data sets?

I made a standalone utility to do this - it can run inference on any of their three pre-trained models given an input sequence of image files. You can find...

Add "owner" parameter to html/svg

I like the idea of supporting this feature exactly as proposed (owner=tzXXX) in the context of 1/1 OBJKTs where there is always a single owner to any particular token.

debugging custom models

Thanks heaps @mrapplexz - this is indeed working well for me. So far I'm surprised how powerful even 100 iterations of fine tuning is (I'll probably tweak the learning rate...

Seeds option

(this has been updated to be compatible with the now merged 0.2 branch)

Seeds option

I am willing to make this compatible with 0.3, but am going to wait until there is some feedback indicating this is a desired feature in the main release.

Seeds option

OK, without pulling my punches here are my thoughts: - the hidden power of this library is that it can potentially learn to map between any two image datasets, thought...

regression

I ended up not taking this route but instead adapted the dataset to be classes (positive / negative). I did find a more detailed read of issues trying to use...

addded decode_predictions2() to utils.py

Here's the `vggface2_class_index.json` to support this new function. This file could eventually replace the need for `rcmalli_vggface_labels_v2.npy` (it contains the same label information but also includes class-ids) and is in...