DialoGPT icon indicating copy to clipboard operation
DialoGPT copied to clipboard

Large-scale pretraining for dialogue

Results 64 DialoGPT issues
Sort by recently updated
recently updated
newest added

Hello all, i would like to start by thanking the DialoGPT team for open sourcing this - it has greatly aided my work. I have been playing with the decoder...

Hi, I would like to be able to do 2 things that I think should be straight forward, but am having trouble figuring them out in the Hugging Face model....

I don't really want to generate any text for my use case I just want the embeddings. But when I used model.forward() with dialoggpt-small, it gave me a tuple with...

Is there a way to send in requests for the decoding script? I understand the nature of the challenges surrounding reddit toxicity, we just want to try it out privately,...

Hi there, thanks for sharing your amazing work on github. Just wanted to point out that the link (https://convaisharables.blob.core.windows.net/lsp/DSTC/medium_ft.pkl) shared in readme for DSTC medium model is the small GPT-2...

line 154 fails ``` output_dir = join(args.output_dir, 'GPT2.{}.{}.{}gpu.{}'.format(args.learning_rate, args.train_batch_size, n_gpu, timestamp)) ``` with: ´´´ Traceback (most recent call last): File "./LSP_train.py", line 154, in args.train_batch_size, n_gpu, timestamp)) File "/opt/conda/envs/pytorch-py3.6/lib/python3.6/posixpath.py", line...

What would be needed to make this work for other languages?

I have two questions about training reversed model. The first one is about training data. I can't see objective reason why prepro.py cuts off big part of training data. I...

Hi, thanks for making the work available and for the explanations. From the paper I understand that a training instance is a dialogue session, made up of several dialogue turns...

When i run `python demo.py` a got an advice how to train model with own data: `python LSP_train.py --model_name_or_path /home/joo/Docs/LocalRepository/DialoGPT/models/small --init_checkpoint /home/joo/Docs/LocalRepository/DialoGPT/models/small/pytorch_model.bin --train_input_file /home/joo/Docs/LocalRepository/DialoGPT/data/train.128len.db --eval_input_file ./data/dummy_data.tsv --output_dir /home/joo/Docs/LocalRepository/DialoGPT/models/output_model --seed 42...