compare-mt icon indicating copy to clipboard operation
compare-mt copied to clipboard

bootstrap sample size

Open ozancaglayan opened this issue 4 years ago • 0 comments

Hello,

I was checking your notes (http://www.phontron.com/class/mtandseq2seq2018/assets/slides/mt-fall2018.chapter11.pdf) and saw the following, which seems to be applied in this codebase as well:

In Line 4, we sample a subset of the test data, where in practice we usually use exactly half of the sentences in the test data.

If I understand correctly, if we have n sentences in the test set, this means that every bootstrap resample has only 0.5 * n sentences in it. What is the intuition of using half of the sentences here?

Thanks

ozancaglayan avatar Mar 07 '21 17:03 ozancaglayan