DIS
DIS copied to clipboard
Final value of training loss
Thanks for your great work! What is the final value of training loss that can be achieved by using intermediate supervision loss?