ml-cvnets icon indicating copy to clipboard operation
ml-cvnets copied to clipboard

Model stability problem

Open Xinjie-Wei opened this issue 3 years ago • 3 comments

Hi! Thank you for the great work. I used the mobilevit blocks for my model to low level task. at begin it has good performance , but I get different performance when I run it once again. my model is stable if I remove the mobilevit blocks. Do you know what problem would make the model instability, I use following basic parameter: max_lr:1e-4 min_lr:1e-6 optim name: adamw scheduler: name: "cosine" in_channels:96 transformer_dim : 144 ffn_dim = 288 n_transformer_blocks=2

Xinjie-Wei avatar Sep 28 '22 12:09 Xinjie-Wei

mobilevit blocks num_heads is 4 would be too small?

Xinjie-Wei avatar Sep 28 '22 12:09 Xinjie-Wei

How many epochs are you training?

sacmehta avatar Oct 30 '22 02:10 sacmehta

epochs=3000

Xinjie-Wei avatar Nov 18 '22 03:11 Xinjie-Wei