guantao18 comments

Results 16 comments of


                                            guantao18

seeking STC dataset

> > The download link can be found in the README file > > https://pan.baidu.com/s/1GKwGDV-0e6dcRR-hVrrKGw?pwd=rev5 or https://drive.google.com/file/d/1jsTyvOz0y_6UIAkaibvvxf6bw0REqAlO/view?usp=sharing > > thx! But I've noticed that in the paper that proposed this...

inference

> hello! Has the problem been solved?

inference

@447428054 @SMinL hello! Has the problem been solved?

inference

hi i get it ! here code : return [(pos[1], pos[2],pseudo_tag) for pos in match_pos_pairs] index is wrong in method "extract_nested_spans" from query_span_f1.py

换成自己的数据集报错，不能训练

哈哈哈，我也是这个问题，解决了吗？我感觉是中文的问题。

换成自己的数据集报错，不能训练

@Josson 是的，这是bert的wordpiece导致的问题，英文和数字bert是按照最长匹配的，如果标注不是按照这个原则标的话就会导致分词前后的pos错位。解决办法是把标注数据按照bert分词规则重新分一遍做标注或者给带字母或数字的前面都加#，这样可以训练但不知道会不会引起新的问题。

换成自己的数据集报错，不能训练

@Josson 脚本中不指定显卡id,直接删除掉，程序会自动找可用显卡的。要是不用多卡训练就设置参数gpus="1"即可。

换成自己的数据集报错，不能训练

@Josson 把max-length改小一些，200以下；还不行就减小batch-size。还不行就等别人用完。

inference 遇到错误

@xiaoya-li @YuxianMeng @littlesulley 请问这可能是什么原因导致的？去年的issues就有这个提问。感谢！

源码分享

> 您好，可以分享下源码不？非常感谢一年了，估计悬了，自己复现一下吧