romain-rsr
romain-rsr
Hi, while trying to run inference rationale generation, I encountered this first issue : ``` self.mha_layer = torch.nn.MultiheadAttention(embed_dim=config.hidden_size, kdim=config.hidden_size, vdim=config.hidden_size, num_heads=1, batch_first=True) TypeError: __init__() got an unexpected keyword argument 'batch_first'...
When running the indicated command for rational training : ``` CUDA_VISIBLE_DEVICES=0,1 python main.py \ > --model allenai/unifiedqa-t5-base \ > --user_msg rationale --img_type detr \ > --bs 8 --eval_bs 4 --eval_acc...