Juhan Bae

Results 8 comments of Juhan Bae

I thought cross_entropy just combines log_softmax and nll_loss.

Hello, thank you for your interest in the code. Could you share which PyTorch version you are using?

I believe that to reproduce the results, it might be better to use PyTorch 1.5.1. Depending on small changes (e.g., initialization), you might get different results. If you want to...

Sorry for the late reply! I am unsure what the issue might be here - if you are still stuck, please feel free to leave a full code. I can...

Sorry for delay in my responses! 1. Unfortunately, we don’t have block-diagonal approximation implemented. If you are running into memory issues, you could increase ‘_module_partition’ so that we iteratively fit...

1. If you would like to implement block-diagonal approximation, the easiest way would be to keep track of a list of covariances (e.g., https://github.com/pomonam/kronfluence/blob/main/kronfluence/module/tracked_module.py#L131), and modify the code accordingly to...