customer-support-chatbot icon indicating copy to clipboard operation
customer-support-chatbot copied to clipboard

Reproducing results

Open ldulcic opened this issue 3 years ago • 4 comments

Following up on discussion from #1

@aitmlouk can you give me more details on how did you train the model? Parameters, number of epochs, batch size, etc. ?

ldulcic avatar May 07 '21 16:05 ldulcic

@ldulcic I used the code as it is, in the first step I used the default values from args and the number of epochs vary between 30 and 130 but always no result when it comes to prediction! To speed up the training process I used a training subset from apple support (train ~ 12000 records, val~5000, test~6000). I tried also the default args from pre-trained models and trained more than 40 epochs but always the pre-trained model is not working. How did u get these pre-trained models?

aitmlouk avatar May 07 '21 16:05 aitmlouk

@aitmlouk Well, if you used only a subset of dataset then you will get worse results. Pre-trained models contain args object which has values of all the arguments I used. Also, you can see in predict.py how many epochs it took me to train those models. Training should take quite a lot of time. I had two GeForce GTX 1080 GPUs → training ran 24-48 hours.

What is you end goal with training this model? What do you want to achieve?

ldulcic avatar May 07 '21 16:05 ldulcic

@ldulcic thanks for this clarification, I want to split the dataset into sub-dataset and do distribute training. what you suggest if I want to train the model on small dataset?

aitmlouk avatar May 07 '21 16:05 aitmlouk

@aitmlouk you won't get far with this model if you don't have lot of data. Try fine-tuning BERT.

ldulcic avatar May 07 '21 17:05 ldulcic