nlp-paper icon indicating copy to clipboard operation
nlp-paper copied to clipboard

自然语言处理领域下的相关论文(附阅读笔记),复现模型以及数据处理等(代码含TensorFlow和PyTorch两版本)

Results 1 nlp-paper issues
Sort by recently updated
recently updated
newest added

Hello! I've found a performance issue in preprocess_tfrecord.py: `dataset.batch(batch_size, drop_remainder=drop_remainder)`[(line 179)](https://github.com/DengBoCong/nlp-paper/blob/60dfa0688d9e95d5dcce9586efcb045d8d47c7ac/paper-code/tensorflow_src/tools/preprocess_tfrecord.py#L179) should be called before `dataset.map(map_func=_parse_dataset_item, num_parallel_calls=mt.cpu_count())`[(line 173)](https://github.com/DengBoCong/nlp-paper/blob/60dfa0688d9e95d5dcce9586efcb045d8d47c7ac/paper-code/tensorflow_src/tools/preprocess_tfrecord.py#L173), which could make your program more efficient. Here is [the tensorflow document](https://tensorflow.google.cn/guide/data_performance?hl=zh_cn#vectorized_mapping)...