Ross Wightman comments

Results 523 comments of


                                            Ross Wightman

1 Channel images not working

long time, but no hard coded needed anymore, passed through to folder, huggingface, webdataset, and tfds dataset readers .. only torchvision datasets aren't easily supported

[FEATURE] Add ViT GSAM

@banda-larga I actually looked at this around xmas time and was surprised, the checkpoints were awful (very poor validation). I tried all sensible input normalizations, etc but always really bad...

[BUG] Inquiry into feature_only setting of ghostnet backbone

actually not really an issue / bug due to head order, fewer layers than described in the issue are chopped off and don't feel it makes sense to change

[FEATURE] Feature extraction for SWIN Transformer

supported on main branch now w/ NHWC output (see #1438 for more)

[FEATURE] Run benchmark `--model-list` in subprocess

`bulk_runner.py` does this, been using it for mass benchmark and validation for a while

[FEATURE] Save dataset_train.reader.class_to_idx inside the model and load it by default during inference and validation

@JustinMBrown it's a reasonable idea, only issue is that it ends up being a big change, ALL pretrained checkpoints right now are bare state_dict with no extra layer in the...

[WIP] Testing the lion optimizer

@lucidrains inconclusive so far, managed to almost match some recent adamw results for large fine-tune, but took a fair bit of search. I feel unless very resource contained adamw still...

Add TextTextCLIP

@rom1504 k, will try and look at it soon

Add TextTextCLIP

overall things look pretty good, I'm trying to get over a mental block re the loss naming, I realize why the feature_a/b changes were made to the loss but I...

CoCa: fix MultimodalTransformer init + Mask CLS token at end of seq

@gpucce so discussing here so I might possibly combine this with #660 checks, this was days before my second child was born so yeah, it got lost in the stack...