Yige Xu comments

Results 22 comments of


                                            Yige Xu

Further Pre-Training on the IMDB dataset

thank you for your issue we have shown some hyperparameters settings in our paper (see section 5.2) for bert checkpoints after further-pretraining, we share a link in our README (see...

How to fine tuning model on multi-tasks?

Thank you for your issue! I have just uploaded our codes about the fine-tuning model on multi-tasks. The multi-task fine-tuning is just adding other softmax layers for other tasks. In...

How to fine tuning model on multi-tasks?

For saving models, we did not save checkpoints during fine-tuning. If you need to save your models, we suggest using torch.save

For Layer-wise Decreasing Layer Rate

sorry for a late answer 1. we also use a warm-up for layer-wise decreasing layer rate, which means, they are used simultaneously 2. we do not conduct experiments about learning...

further pre-training

hi thank you for your interest in our work. the config and the vocab file are the same as the original one, therefore our code does not automatically output the...

Questions about discriminative_fine_tuning

Thank you for your issue! 1. The number 2.6 was set for the beginning experiments, after that, we use run_classifier_discriminative.py for discriminative fine-tuning. 2. The link to run_classifier_discriminative.py is https://github.com/xuyige/BERT4doc-Classification/blob/master/codes/fine-tuning/run_classifier_discriminative.py...

Yige Xu

Further Pre-Training on the IMDB dataset

How to fine tuning model on multi-tasks?

How to fine tuning model on multi-tasks?

For Layer-wise Decreasing Layer Rate

further pre-training

Questions about discriminative_fine_tuning

hight perplexity when Further Pre-Training

How much time did it take to run the further pre-training step?

OOM when batchSize=1

OOM when batchSize=1