lpyhdzx

Results 2 issues of lpyhdzx

Very good work. I was looking at the paper and noticed more inconsistencies between the results in Table 1 and the fine-tuning results in github, for example, llama-7B has gsm8k...

Hi! Great work! I have a question about the default value of the lambda params. I've noticed that they are initialized to zero by default: `lambda_1_layer = torch.nn.Parameter(torch.tensor(0.0, device=self.device))` Given...