transformer-pointer-generator
transformer-pointer-generator copied to clipboard
Do you consider the introduction of the pre-trained model for embedding,such as bert?
the structure is bert embedding + Transformer decoder +PGN?