DSXiangLi comments

Results 25 comments of


                                            DSXiangLi

Performance issue in utils.py (by P3)

@DLPerf Thanks for point this out! Honestly I haven't pay much attention to performance before >< I just took a look at that performance doc, and found there are actually...

运行main.py --model bert_bilstm_crf_adv --data msra,msr 时报错：

@fengxuefx 辛苦发一下你的运行脚本吧

ctb.50d.vec文件没找到。 data/msra/preprocess.py

请仔细阅读每个子folder的readme，pretrain_model里面有写需要下载的模型和链接

复现的问题

@ZR5932 https://github.com/DSXiangLi/ChineseNER/blob/main/requirement.txt requirement 是我直接从当前环境导出来的，可以直接装个virtual试一下

@ZR5932 第一个问题我不太确定，可能是你下载的word embedding 是binary format的。如果是glove format试一下把glove_2_wv里面加载词向量的部分KeyedVectors.load_word2vec_format，设置binary=True。word enhance可以看下这篇博客https://www.cnblogs.com/gogoSandy/p/14965711.html

bert_bilstm_crf_adv：ValueError: Shape must be rank 2 but is rank 1 for 'task1_msra/crf_layer/Slice_2' (op: 'Slice') with input shapes: [?], [2], [2].

@LinJingOK 是数据生成有问题，giga和bert是两个不同的tokenizer，前者是词粒度，后者是token粒度。bert模型使用的都是bert tokenizer，所以tfrecord文件是bert_train.tfrecord, 其他非bert模型是giga_train.tfrecord, 词表增强文件会是giga_softword.tfrecord之类的

bert_bilstm_crf_adv：ValueError: Shape must be rank 2 but is rank 1 for 'task1_msra/crf_layer/Slice_2' (op: 'Slice') with input shapes: [?], [2], [2].

@LinJingOK checkpoint里面会生成对应ckpt文件，可以用tensorboard --logdir ./checkpoint/your_model_path 来查看模型当前训练进展

DSXiangLi

Performance issue in utils.py (by P3)

运行main.py --model bert_bilstm_crf_adv --data msra,msr 时报错：

ctb.50d.vec文件没找到。 data/msra/preprocess.py

复现的问题

复现的问题

bert_bilstm_crf_adv：ValueError: Shape must be rank 2 but is rank 1 for 'task1_msra/crf_layer/Slice_2' (op: 'Slice') with input shapes: [?], [2], [2].

bert_bilstm_crf_adv：ValueError: Shape must be rank 2 but is rank 1 for 'task1_msra/crf_layer/Slice_2' (op: 'Slice') with input shapes: [?], [2], [2].

hello，我想用自己的数据进行复现，需要哪些步骤

HELP

tensorboard --logdir ./checkpoint/ner_msra_bert_bilstm_crf