ml-cvnets
ml-cvnets copied to clipboard
Model stability problem
Hi! Thank you for the great work. I used the mobilevit blocks for my model to low level task. at begin it has good performance , but I get different performance when I run it once again. my model is stable if I remove the mobilevit blocks. Do you know what problem would make the model instability, I use following basic parameter: max_lr:1e-4 min_lr:1e-6 optim name: adamw scheduler: name: "cosine" in_channels:96 transformer_dim : 144 ffn_dim = 288 n_transformer_blocks=2
mobilevit blocks num_heads is 4 would be too small?
How many epochs are you training?
epochs=3000