BLINK icon indicating copy to clipboard operation
BLINK copied to clipboard

How to use bi-encoders to achieve Chinese entity linking on your own data set (including entity recognition)

Open Grace-Wang-Lab opened this issue 3 years ago • 1 comments

The function I want to achieve is to give a piece of Chinese text and find the best candidate entity from my entity database.Can you write a document describing how to implement this process. As a novice, I don't know much about the process and I hope to get help. @fabiopetroni

Grace-Wang-Lab avatar Feb 04 '21 01:02 Grace-Wang-Lab

@Grace-Wang-Lab: It is indeed possible to train a model on your own custom dataset. I would recommend first training a zeshel (Zero-shot learning) model, then bring your data in the same format as zeshel and train your model.

Scripts to download and pre-process zeshel data set are in the repository.

abhinavkulkarni avatar May 05 '22 03:05 abhinavkulkarni