Seungjae Park
Results
1
issues of
Seungjae Park
## Description The self.scaling parameter is created using torch.empty and is then used without being initialized. This is not a problem during finetuning, because pretrained checkpoints provide valid scaling parameters....