learning-to-reweight-examples
learning-to-reweight-examples copied to clipboard
Two issues
If Adam optimizer is used, can it still work ? (Line 7. indicates a standard gradient desent method) Or, this just fit into the SGD based optimizer? 2. I would like to use this reweighting strategy in more complicated neural framework such as LSTM, BERT for other downstream tasks. Whether I must modify these to the 'meta-style' structures ? It seems to be trivial.
@mengye-ren