Seungjae Park

Results 1 issues of Seungjae Park

## Description The self.scaling parameter is created using torch.empty and is then used without being initialized. This is not a problem during finetuning, because pretrained checkpoints provide valid scaling parameters....