RepDistiller icon indicating copy to clipboard operation
RepDistiller copied to clipboard

How do you choose the optimal hyper-parameters?

Open JinYang88 opened this issue 4 years ago • 2 comments

There are several hyper parameters existing:

  1. teacher model hyper parameters
  2. student model hyper parameters
  3. KD hyper parameters (e.g., balance weight for different losses)
  4. Training hyper parameters (e.g., learning rate)

It is hard to enumerate for every combination, because it may explode. How do you find the best (or suboptimal) hyper parameter?

Thanks!

JinYang88 avatar Apr 24 '20 02:04 JinYang88

same question, just do not know how to set weight for different loss

surprisedong avatar Dec 25 '21 14:12 surprisedong

Hi, were you able to figure out a good set of hyperparameters?

ShristiDasBiswas avatar Mar 27 '24 19:03 ShristiDasBiswas