PKD-for-BERT-Model-Compression issues

Results 4 PKD-for-BERT-Model-Compression issues

Sort by recently updated

请问一个问题

代码中有一个--teacher_prediction，这个哪来的？是在训练teacher模型中保存下来的？为什么没看到？

Why do you set for KD.Full like this [fix_pooler=True]?

Hi, Thank you for your interesting work! I just wondering why don`t you used the pooler for only KD.Full and if you use the pooler, did you initialize the pooler...

GeondoPark

Some questions about layer number (model size)

Hi, Thank you for your interesting work! I have just started to learn BERT and distillation recently. I have some general questions regarding this topic. 1. I want to compare...

ZLKong

Not able to reproduce results

First, thank you for releasing your code. I am trying to reproduce results of your paper. I am running `NLI_KD_training.py` for MRPC with DEBUG=True. The setting I am running is...

ashim95

PKD-for-BERT-Model-Compression
PKD-for-BERT-Model-Compression copied to clipboard

Metadata

请问一个问题

Why do you set for KD.Full like this [fix_pooler=True]?

Some questions about layer number (model size)

Not able to reproduce results

← Metadata

Owner

Metadata

PKD-for-BERT-Model-Compression PKD-for-BERT-Model-Compression copied to clipboard

Metadata

请问一个问题

Why do you set for KD.Full like this [fix_pooler=True]?

Some questions about layer number (model size)

Not able to reproduce results

← Metadata

Owner

Metadata

PKD-for-BERT-Model-Compression
PKD-for-BERT-Model-Compression copied to clipboard