Yuxin Jiang 姜宇心

Results 13 comments of Yuxin Jiang 姜宇心

Thank you for your interest in our work. In our research, we utilize the autoregressive language modeling objective to train the student model. This involves using the teacher model's responses...

From lines 116 to 142 in `src/train.py`, we define the dataset used for training, which contains the input as well as the label (target). The training loss is inner defined...

Thanks for your interest in our work! **For your first and second questions**, the errors you are encountering are due to parsing failures in the function `def paring_discriminative_generation(generation, level)` in...