unilm icon indicating copy to clipboard operation
unilm copied to clipboard

Running the EdgeLM model on XSum

Open pramodith opened this issue 3 years ago • 2 comments

Hi I am trying to use the EdgeLM model and was wondering

  1. how I can run the model with pre-trained weights in interactive mode, I'd like to run it for a summarization task.
  2. I was also wondering if the pre-trained weights can be used to replicate the results reported on the XSum dataset.
  3. How to make use of the pre-trained 2k vocab file and sentencepiece model while attempting 1 and 2.
  4. How can I run the model on CPU only.

pramodith avatar Aug 22 '22 13:08 pramodith

Hi,

(1) you can try fairseq-interactive to use the model in the interactive mode. (2) Yes, but you need to fine-tune the model for XSum. The pretrained weights cannot be directly used for summarization (3) It is the same way as shown in the instructions, just need to replace fairseq-generate with fairseq-interactive (you can refer to fairseq-interactive to see the detailed instructions) (4) You need to add the flag --cpu while running fairseq-interactive.

getao avatar Aug 24 '22 02:08 getao

Hi @getao i was wondering what to use for --task for pertaining edgeformer for summarization?

inderpreetsingh01 avatar Aug 25 '22 09:08 inderpreetsingh01