BBPE
BBPE copied to clipboard
BBPE 底层实现
Traceback (most recent call last): File "/home/quiana/work/python-toturial/Tokenization/BBPE/train.py", line 11, in BBPETokenizer.train_tokenizer(data, vocab_size, vocab_outfile=vocab_outfile, merges_outfile=merges_outfile) File "/home/quiana/work/python-toturial/Tokenization/BBPE/bbpe.py", line 170, in train_tokenizer most_common_pair = pair_freq.most_common(1)[0][0] ~~~~~~~~~~~~~~~~~~~~~~~~^^^ IndexError: list index out of range
https://github.com/OctopusMind/BBPE/blob/357382279e0f66b0e67acdd700898087f372f1db/bbpe.py#L124 如题,这里使用python多线程的作用是?