otfusion icon indicating copy to clipboard operation
otfusion copied to clipboard

How were the two models being created?

Open dogma346 opened this issue 2 years ago • 1 comments

Hi, Sidak. My name is Terai, and I am an undergraduate student studying informatics engineering.

I have read your paper (titled as Model Fusion via Optimal Transport) published on NIPS'20. After reading through the paper, I have tried to reproduce the simulation results using the codes publicly available on GitHub, but I have two simple questions: How were the two models (used in Figure 2 of the paper) being created? How can I leverage your codes to retrain the models by myself?

I know you are busy, but I would greatly appreciate it if you could help me. Thanks.

dogma346 avatar Mar 29 '22 03:03 dogma346

Hi, Sidak. My name is Terai, and I am an undergraduate student studying informatics engineering.

I have read your paper (titled as Model Fusion via Optimal Transport) published on NIPS'20. After reading through the paper, I have tried to reproduce the simulation results using the codes publicly available on GitHub, but I have two simple questions: How were the two models (used in Figure 2 of the paper) being created? How can I leverage your codes to retrain the models by myself?

I know you are busy, but I would greatly appreciate it if you could help me. Thanks.

Hi, Terai. I have also tried to reproduce the results. The resnet model may be trained using cifar/models/resnet.py. Please note that this model structure is different from that in torchvision.models. And the training hyperparameters are listed in cifar/hyperparameters. You can use train_cifar_models.py to train source models.

I have trained resnet models with BN layer and linear layer bias, while the provided models only contain the conv weights and linear weights. So I am not sure the actual model architecture during training.

XueY-L avatar Apr 23 '23 15:04 XueY-L