fairseq-detect-hallucination
fairseq-detect-hallucination copied to clipboard
The amount of synthetic data
Hi, I personally find your work very interesting!
There is a little question though. I wonder how much synthetic data did you generate to train the final predictor? Are the synthetic data based on the $D_{train}$, which contains around 4.77M sentences in total?