Sasha Behrouzi
Sasha Behrouzi
my student mAP is 73% on my test dataset. Let me describe my workflow: 1- I trained Retinanet-r101 with my data **(I am using my usecase data which is damage...
Thanks for the reply. 1- Would you please explain more about keeping distillation the setting as 2? 2- I am using inheriting strategy for initializing neck and head of student...
1- In here the initialization of the backlog skipped: `if name.startswith("backbone."): continue` 2- My teacher and student trained with Adam lr=0.001, Do you think should I change the distiller configuration...
I have initialized the backbone of the student and adjusted the optimizer same as baseline. now I am starting with 71% with the first epoch. But in the next epochs...
Thanks. The problem was learning rate. I reduced my learning rate and now the optimization is working. I will share the result under this post for reference. My other question...