Mimick icon indicating copy to clipboard operation
Mimick copied to clipboard

Transformer Models

Open mertozlutiras opened this issue 3 years ago • 1 comments

Hi, is it possible to integrate it with transformer-based models, such as a variation of BERT?

mertozlutiras avatar Aug 26 '22 11:08 mertozlutiras

Hi, can you please give more details? Are you referring to replacing the Mimick LSTM with a transformer, or applying the Mimick idea within a BERT-like model?

For the latter, this would include solving some matters which are far from trivial, such as pre-training MLM objective and multi-token words. One solution is recorded in this preprint, for which code release is still unfortunately delayed.

yuvalpinter avatar Aug 29 '22 07:08 yuvalpinter