LINE icon indicating copy to clipboard operation
LINE copied to clipboard

How to evaluate the output embeddings?

Open long4glasgow opened this issue 7 years ago • 3 comments

long4glasgow avatar Sep 03 '17 21:09 long4glasgow

It depends on the task you test. Try using the keywords such as:

  • word embeddings
  • link prediction
  • network reconstruction

chihming avatar Sep 04 '17 15:09 chihming

I'm trying to reproduce the result of the LINE paper, but fail to find the code for classification and word analogy tasks. For the youtube dataset, I would like to try the classification experiment. For wordembedding task, I would like to try semantic/syntactic accuracy. Do you guys have any idea where can I get the Wikipedia network as well? Many Thanks, Long

long4glasgow avatar Sep 04 '17 15:09 long4glasgow

A common way is to use libfm to do one-versus-rest classification. For word analogy, this Python toolkit could help you. As to the wiki dataset, you can get the dump data from here or find the preprocess one here. In addition, I maintain a list related to embedding models, which may provide you some info. Check from here.

chihming avatar Sep 05 '17 01:09 chihming