Jianfeng Liu
Results
2
issues of
Jianfeng Liu
I'm interested in your paper -- 'Input Combination Strategies for Multi-Source Transformer Decoder', Would you mind telling me how can I reproduce this work. I want cite this paper. Thanks
I found that the parameter initialization at **reset_parameters()** of the **Embedding** class differs from the LoRa paper and other implementations at **layers.py**. I initialized lora_A with **nn.init.normal_()** while Lora_B with...