Jack Morris comments

Results 127 comments of


                                            Jack Morris

Reproducing results from paper

yes, the MSMarco longer-sequence-length dataset included sequences from 1 to 128 tokens

Reproducing results from paper

Hi @carriex -- this looks right! I'm pretty sure that's the right model. Can you share the error with me? Or maybe we can work out of a Colab to...

Reproducing results from paper

Ok there was something weird with the pre-trained model from HuggingFace which I will look into. For now, I developed a workaround; here's some code that properly loads the hypothesizer...

Reproducing results from paper

(The only line I changed was adding this:) ```python training_args.corrector_model_from_pretrained = "jxm/vec2text__openai_ada002__msmarco__msl128__hypothesizer" ```

Reproducing results from paper

Hmm, the command looks right and the numbers are close but a little low. Oddly the dataset looks different -- I've never seen that example (`"Toonimo Toonimo is a..."`) before....

Reproducing results from paper

Yep it should be the last number in the figure, the one you highlighted. And you're right -- it should be the NQ validation set (not MSMARCO, my mistake). Something...

What's happening in the example?

@startakovsky can you be more specific? which example, and what did you think is confusing