mm-cot
mm-cot copied to clipboard
Question about two stages training?
Hi,I wonder the second stage fintuning is based on finetuned first stage T5 model or initial T5 model?
It is an initial T5 model. I did not find obvious performance gains by using the finetuned first stage T5 model.