mPLUG icon indicating copy to clipboard operation
mPLUG copied to clipboard

Question about baseline reward in `caption_mplug_scst.py`

Open czy-orange opened this issue 1 year ago • 0 comments

The code in this repo shows that baseline reward is calculated by averaging reward of generated captions. However, the original version of scst as well as some other scst implementation (e.g., in VALOR) calculate the baseline reward with greedy-search-generated caption. Is there any reference or explanation about current implementation in this repo? Really appreciate it if I obtain any help.

czy-orange avatar Dec 21 '23 05:12 czy-orange