terminator123 issues

Results 8 issues of


                                            terminator123

模型维度问题

gpt2模型config里面n_positions=513，会报 size mismatch for transformer.h.0.attn.bias: copying a param with shape torch.Size([1, 1, 512, 512]) from checkpoint, the shape in current model is torch.Size([1, 1, 513, 513]). 改成512后，如果use_gpt2=True，会报 size mismatch for...

预测问题

直接用CDial-GPT2_LCCC-base 预测预测那部分代码有修改，不然跑不通 output = model(input_ids, token_type_ids=token_type_ids) logits = output.logits logits = logits[0, -1, :] / args.temperature 不管输入是什么，结果都如下 [12997, 7635, 12997, 7635, 12997, 12997, 12997, 12997, 7635, 12997, 7635, 12997,...

question about inference output

### Description when i use tst-decoder， i want to get each timestep's probablity, such as 'a' [0.1, 0.1, 0.2.....]。i tried to add log parameter in decode_hparams， it didn't work。 i...

cnn-hierarchy code error

when i run this mode, it throws the error that 'X_tst' 'Y_tst' not defined。 it should be 'x_tst' ,'y_tst' in cnn_train.py

paddle hub 如何调用gpu

同时装了paddlepaddle和paddlepaddle-gpu， model = hub.Module(directory='./baidu_translate')，使用的是cpu，速度比较慢。如果卸载paddlepaddle，只留paddlepaddle-gpu，hub会报错 File "/home/chenqun/.conda/envs/end2endtrans/lib/python3.9/site-packages/paddlehub/__init__.py", line 20, in _paddle_version = Version(paddle.__version__) AttributeError: module 'paddle' has no attribute '__version__'

terminator123

模型维度问题

预测问题

question about inference output

cnn-hierarchy code error

paddle hub 如何调用gpu

[Question] how to merge the middle checkpoint file with lora

downloading wget https://msmarco.blob.core.windows.net/msmarcoranking/collection.tar.gz -O msmarco.tsv

beam设置的问题