Yixin Liu comments

Results 21 comments of


                                            Yixin Liu

Ranking Loss Question

Hi Griffin, I have also found this comparison very interesting! My guess is adjusting the likelihood has a more direct impact on the decoding output than adjusting the latent representation,...

Hi Tanya, I'm really sorry for the late reply. I've uploaded the checkpoint (as a generator) for NYT [here](https://drive.google.com/file/d/1O_XXQJLO7cgAmS0GPueV9SJtLQ8MHX7F/view?usp=sharing). I hope it's not too late :) I wanted to note...

There is a bug in the Preprocessed cnndm dataset

Hi, could you provide more details on your finding (e.g. how did you notice this example? how did it affect your experiments?) I spot-checked around 10 examples and didn't find...

There is a bug in the Preprocessed cnndm dataset

Hi, @aidejieceng, @thangld201, thanks a lot for bringing this to my attention! I found that there indeed is a misalignment between the tokenized input articles and reference summaries for CNN/DailyMail...

If I want to create a new dataset (not CNN/DailyMail and XSum) , what should I prepare for it?

Hi, - Is the file (without the suffix --- ".tokenized" ) should be filled with the original sentence? Yes. - Which tokenizer should be used to tokenize the sentence in...

model.pt question

Hi! A checkpoint will be stored in a folder in the `./cache` directory along with the config.txt after the first evaluation step (the default setting should be 1000 update steps)....

Example begin dataset

Please check out here https://github.com/yixinL7/BRIO/tree/main/examples/raw_data (our new work). It contains the example files.

Train set and test set ranking distribution difference

Good questions :) > Considering the Neural model's amazing capacity for memorization, the candidate generation of training set for evaluation model should be nearly perfect. That's not exactly true because...

Train set and test set ranking distribution difference

I'd like to emphasize my point that **if the model is overfitting too much on the training set it would not perform well on the evaluation set**. So it's possible...

About BS and MS metrics

Hi! We didn't use the default version (roberta-based) of BertScore instead we used the bert-based model mainly because of different tokenizations. It has been a while so I couldn't recall...