BLINK icon indicating copy to clipboard operation
BLINK copied to clipboard

Training the biencoder

Open anshks opened this issue 4 years ago • 4 comments

I was trying to reproduce the code. However, I am facing issues with training the biencoder. Could you please tell how to finetune the biencoder on the datasets mentioned in the paper? Also, can you provide the WikilinksNED Unseen-Mentions dataset which you used? @fabiopetroni

anshks avatar Jan 17 '21 15:01 anshks

@anshks We will update with an instruction on how to fine-tune the model shortly. For the WikilinksNED Unseen-Mentions dataset, we obtained it from the authors of this paper: https://ojs.aaai.org/index.php/AAAI/article/view/6380

ledw avatar Jan 27 '21 04:01 ledw

@anshks We will update with an instruction on how to fine-tune the model shortly. For the WikilinksNED Unseen-Mentions dataset, we obtained it from the authors of this paper: https://ojs.aaai.org/index.php/AAAI/article/view/6380

+1 Waiting for fine-tuning note :)

nlp4whp avatar Mar 11 '21 07:03 nlp4whp

@anshks We will update with an instruction on how to fine-tune the model shortly. For the WikilinksNED Unseen-Mentions dataset, we obtained it from the authors of this paper: https://ojs.aaai.org/index.php/AAAI/article/view/6380

@ledw any progress on this? I'm trying to train the bi-encoder, but can't find the details of how you trained it (e.g. scripts, hyperparameters).

shzamanirad avatar Aug 16 '21 07:08 shzamanirad

It is indeed possible to train a model on your own custom dataset. I would recommend first training a zeshel (Zero-shot learning) model, then bring your data in the same format as zeshel and train your model.

Scripts to download and pre-process zeshel data set are in the repository.

abhinavkulkarni avatar May 05 '22 03:05 abhinavkulkarni