Teacher-free-Knowledge-Distillation issues

Mismatch between Eq.9 in the paper and the code

4

Hello, thanks for your great work! I have a question about a possible mismatch between the Eq.9 in the paper and the real [implementations in the code](https://github.com/yuanli2333/Teacher-free-Knowledge-Distillation/blob/dc7464870b605b8acd303a00aa42084254d7c383/my_loss_function.py#L47). Here are the...

MingSun-Tse

How to search the best temperature and alpha

5

hello ,author. I read the paper, the parameters (temperature and alpha) are obtained by gird search. can you release the code . I want to learn it . Thank you.

TimeBear

Pretrained model for student network

2

Thanks for the great work. I find the pre-trained model for the teacher network. Will you release the Pretrained model for student network? Thanks!

he-y

do you have email? I have some trouble with your code.

2

do you have email? I have some trouble with your code.

TimeBear

KD loss is zero

My loss after distillation is 0, which feels very strange. I want to ask whether there is a problem with the distillation method or the calculation of distillation function in...

minato1000

Bump werkzeug from 0.15.4 to 2.2.3

Bumps [werkzeug](https://github.com/pallets/werkzeug) from 0.15.4 to 2.2.3. Release notes Sourced from werkzeug's releases. 2.2.3 This is a fix release for the 2.2.x release branch. Changes: https://werkzeug.palletsprojects.com/en/2.2.x/changes/#version-2-2-3 Milestone: https://github.com/pallets/werkzeug/milestone/26?closed=1 This release contains...

dependabot[bot]

dependencies

Bump certifi from 2018.8.24 to 2022.12.7

Bumps [certifi](https://github.com/certifi/python-certifi) from 2018.8.24 to 2022.12.7. Commits 9e9e840 2022.12.07 b81bdb2 2022.09.24 939a28f 2022.09.14 aca828a 2022.06.15.2 de0eae1 Only use importlib.resources's new files() / Traversable API on Python ≥3.11 ... b8eb5e9 2022.06.15.1...

dependabot[bot]

dependencies

Does this method work on the detection tasks？

Hi, it's a very good and interesting work! How do you think the effectiveness of this method in detection tasks?

fmaaf

Bump protobuf from 3.15.0 to 3.18.3

Bumps [protobuf](https://github.com/protocolbuffers/protobuf) from 3.15.0 to 3.18.3. Release notes Sourced from protobuf's releases. Protocol Buffers v3.18.3 C++ Reduce memory consumption of MessageSet parsing This release addresses a Security Advisory for C++...

dependabot[bot]

dependencies

I trained the resnet50 baseline model for 500 rounds, but the accuracy obtained was only 71%

panshoudeng

Teacher-free-Knowledge-Distillation
Teacher-free-Knowledge-Distillation copied to clipboard

Metadata

Mismatch between Eq.9 in the paper and the code

How to search the best temperature and alpha

Pretrained model for student network

do you have email? I have some trouble with your code.

KD loss is zero

Bump werkzeug from 0.15.4 to 2.2.3

Bump certifi from 2018.8.24 to 2022.12.7

Does this method work on the detection tasks？

Bump protobuf from 3.15.0 to 3.18.3

I trained the resnet50 baseline model for 500 rounds, but the accuracy obtained was only 71%

← Metadata

Owner

Metadata

Teacher-free-Knowledge-Distillation Teacher-free-Knowledge-Distillation copied to clipboard

Metadata

← Metadata

Owner

Metadata

Teacher-free-Knowledge-Distillation
Teacher-free-Knowledge-Distillation copied to clipboard