Variational-Transformer
Variational-Transformer copied to clipboard
Is PRETRAIN necessary?
Hi, I notice that SVT is trained after loading the parameters of a pretrained model (including encoder & decoder). I am curious about if pretraining is necessary. Had you tried train SVT from scratch? What's the difference between these two training schemes?
如果您能看懂中文,希望可以加微信高效沟通,我的微信号 He_2262,感激不尽!