Akshay issues

Results 10 issues of


                                            Akshay

Regarding the scikit-plot.metrics.plot_roc function

In you code I noticed that if we pass classes in the form of their actual meaning instead of (0,1,2 .. ) and we pass it as (c,b,a) then np.unique(y_true)...

How does the logit_scale vary while training , i noitced that in my case it starts from the 14.28(1/0.07) and then just goes down and towards the end of the training it reaches 1

I am running clip on my own dataset and noticed this where the logit_scale converges to 1. Is this a good behavior to expect , i noticed that the loss...

Is there any particular reason why bias term is kept as False in the projection layers

In the code given here in this file https://github.com/mlfoundations/open_clip/blob/main/src/open_clip/hf_model.py for the projection head the bias is turned to False , I feel it shouldnt matter and keeping it as True...

why is masking performed again during the inference decoder stage?

Have we added VeRA (Vector Based Random Matrix Adaption) , it recently got published at ICLR 2024

A recent paper by Qualcomm AI Research proposed a new parameter efficient finetuning method called [VeRA](https://arxiv.org/pdf/2310.11454.pdf%E2%80%8D). This uses two projection matrices (as in LoRA) that are randomly initialised, frozen, and...

pending

The projection head order needs to be be relooked

Going through these papers 1) https://arxiv.org/pdf/1603.05027.pdf 2) https://arxiv.org/pdf/2302.06112.pdf ``` class ProjectionHead(nn.Module): def __init__( self, embedding_dim, projection_dim=CFG.projection_dim, dropout=CFG.dropout ): super().__init__() self.projection = nn.Linear(embedding_dim, projection_dim) self.gelu = nn.GELU() self.fc = nn.Linear(projection_dim, projection_dim)...

DeepLabV3Plus is not compatible with encoder_depth=4 and swin models

So i was working with both swinv2_tiny_window8_256 and swinv2_base_window12to16_192to256 and noticed that it was not loading with torchseg.DeepLabV3Plus ``` model = torchseg.DeepLabV3Plus( "swinv2_base_window12to16_192to256", in_channels=1, classes=2, encoder_weights=True, encoder_depth=4, decoder_channels=256, encoder_output_stride=16, encoder_params={"img_size":...

Akshay