clang8 icon indicating copy to clipboard operation
clang8 copied to clipboard

Hyperparameters for prediction

Open YovaKem opened this issue 3 years ago • 1 comments

Can you tell me what hyperparameters were used for the beam search at inference time and anything concerning penalty for length and repetition? Thanks!

YovaKem avatar Nov 08 '21 19:11 YovaKem

Hi, we used greedy decoding for inference.

ekQ avatar Nov 11 '21 16:11 ekQ