transfer-learning-conv-ai issues

Confusion in pretrained model.

1

Hello. I am trying to run your model and I have some confusion in your pre-trained model. It seems that ``train.py`` trained the model with doublehead model, but in the...

Han8931

How to achieve the 18 ppl?

Hello! I tried the different setting of your model, for example, changed the token level loss to sentence level loss. And used the beam search as you mentioned. But the...

ZHANG45

TypeError: can only concatenate list (not "tuple") to list

3

Hi I am using python 3.6, and I run python train.py --model_checkpoint pretrained_transformers/gpt --dataset_path datasets/personachat_self_original.json thanks INFO:/dev/ccn/generation/transfer-learning-conv-ai/utils.py:Tokenize and encode the dataset Traceback (most recent call last): File "train.py", line 271,...

ghost

Pad each batch, not the whole dataset

Previously, each sequence was padded to the length of the longest sequence in the *dataset*. In this PR, each *batch* is padded to the length of the longest sequence in...

sshleifer

why num_candidates set to min(args.num_candidates, len(dataset[0]["utterances"][0]["candidates"])?

1

In train.py, starting from line number 81- `for dataset_name, dataset in personachat.items():` `num_candidates = len(dataset[0]["utterances"][0]["candidates"])` `if args.num_candidates > 0 and dataset_name == 'train':` `num_candidates = min(args.num_candidates, num_candidates)` Please explain me...

rash19

Pytorch Lightning as a back-end

5

Hi Team, spoke with @thomwolf about possibly using Lightning as your backend! This would remove the need to do your own distributed computing and 16-bit stuff. Check out the simple...

williamFalcon

README vs Defaults: Which training parameters lead to Hits@1 over 79

1

Hi team thank you very much for the great work and the clean code! I got some problem while running the code and was wondering if you could give me...

starlightwy

Upgrade get_dataset.tokenize() to multiprocessing

4

get_dataset.tokenize() on a single CPU is very slow. Therefore in this pull request it is upgraded to multiprocessing by implementing the multiprocessing target function worker_tokenize(args_list). Additionally a multiprocessing debug logger...

DrStoop

Sample personality too slow and repeated question

2

1. when you call get_dataset_personalities(tokenizer, args.dataset_path, args.dataset_cache) it parser from personachat_self_original.json which contains the whole training set and take long time. Think it's better to sample from a smaller file....

linkinbird

Is pc a food?

1

This [chat](https://convai.huggingface.co/persona/my-favorite-jello-is-the-blue-one-i-ve-long-red-hair-i-don-t-eat-asparagus-i-work-at-home-on-my-computer) doesn't look fine to me. ![pc-food](https://user-images.githubusercontent.com/21178372/57714630-ebf2fc80-7642-11e9-922f-a8081f3c3307.png)

rpanai

transfer-learning-conv-ai
transfer-learning-conv-ai copied to clipboard

Metadata

Confusion in pretrained model.

How to achieve the 18 ppl?

TypeError: can only concatenate list (not "tuple") to list

Pad each batch, not the whole dataset

why num_candidates set to min(args.num_candidates, len(dataset[0]["utterances"][0]["candidates"])?

Pytorch Lightning as a back-end

README vs Defaults: Which training parameters lead to Hits@1 over 79

Upgrade get_dataset.tokenize() to multiprocessing

Sample personality too slow and repeated question

Is pc a food?

← Metadata

Owner

Metadata

transfer-learning-conv-ai transfer-learning-conv-ai copied to clipboard

Metadata

← Metadata

Owner

Metadata

transfer-learning-conv-ai
transfer-learning-conv-ai copied to clipboard