TPlinker-joint-extraction
TPlinker-joint-extraction copied to clipboard
implementation questions
Hi, Thanks for your good idea and paper, have some questions about TPLinkerPlus
- I see you use logsum to calculate the loss, is it better than BCE?
- Is there any reference/paper for "conditional" layernorm which is used for "cln" shaking_type? Thanks.
- It brought a slight improvement. For more details, see Extend "softmax+cross entropy" to multi-label classification problem
- For all I know, no. But you can refer to this blog Conditional text generation based on Conditional Layer Normalization.
Thanks for your quick response and guidance. Looking forward to your paper to know the gains of these 2 component. The first run of my implementation only have about 90.x on the NYT dataset without these 2 component(no hyper parameter tuning, just converge) which looks too low. I will double check the code and tuning a little. Thanks.