Dual-Contrastive-Learning icon indicating copy to clipboard operation
Dual-Contrastive-Learning copied to clipboard

Why did the Dual gradient collapse on my own Chinese dataset?

Open wangqian97 opened this issue 2 years ago • 1 comments

Dear author, your framework is valid on the English dataset, but when I used dual-loss deficiency on my Chinese dataset, gradient collapse occurred. My Chinese label is two characters, is it related to this? Or do I have to adjust somewhere? Thank you very much. Look forward to hearing from you soon

wangqian97 avatar Oct 25 '22 12:10 wangqian97

Hi, we should use a single word to tokenize each label in DualCL. I conjecture that if the two-character Chinese label is encoded by two or more tokens, the DualCL loss will perform abnormally. Consider adding the whole label to the dictionary or alerting the label to a single (Chinese) character.

hiyouga avatar Oct 26 '22 03:10 hiyouga