Teacher-free-Knowledge-Distillation
Teacher-free-Knowledge-Distillation copied to clipboard
Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization
Hello, thanks for your great work! I have a question about a possible mismatch between the Eq.9 in the paper and the real [implementations in the code](https://github.com/yuanli2333/Teacher-free-Knowledge-Distillation/blob/dc7464870b605b8acd303a00aa42084254d7c383/my_loss_function.py#L47). Here are the...
hello ,author. I read the paper, the parameters (temperature and alpha) are obtained by gird search. can you release the code . I want to learn it . Thank you.
Thanks for the great work. I find the pre-trained model for the teacher network. Will you release the Pretrained model for student network? Thanks!
do you have email? I have some trouble with your code.
My loss after distillation is 0, which feels very strange. I want to ask whether there is a problem with the distillation method or the calculation of distillation function in...
Bumps [werkzeug](https://github.com/pallets/werkzeug) from 0.15.4 to 2.2.3. Release notes Sourced from werkzeug's releases. 2.2.3 This is a fix release for the 2.2.x release branch. Changes: https://werkzeug.palletsprojects.com/en/2.2.x/changes/#version-2-2-3 Milestone: https://github.com/pallets/werkzeug/milestone/26?closed=1 This release contains...
Bumps [certifi](https://github.com/certifi/python-certifi) from 2018.8.24 to 2022.12.7. Commits 9e9e840 2022.12.07 b81bdb2 2022.09.24 939a28f 2022.09.14 aca828a 2022.06.15.2 de0eae1 Only use importlib.resources's new files() / Traversable API on Python ≥3.11 ... b8eb5e9 2022.06.15.1...
Hi, it's a very good and interesting work! How do you think the effectiveness of this method in detection tasks?
Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.15.0 to 3.18.3. Release notes Sourced from protobuf's releases. Protocol Buffers v3.18.3 C++ Reduce memory consumption of MessageSet parsing This release addresses a Security Advisory for C++...
I trained the resnet50 baseline model for 500 rounds, but the accuracy obtained was only 71%