lyd126

Results 6 comments of lyd126

Thank you for your reply. To clarify, while running the example code (# Input Example: import torch x = torch.ones([6, 1024], device='cuda:0')......), when I set the 'expert' value to 1...

Thank you very much for your prompt reply. I set the gate_noise to 0 according to what you said, but the result is still the same as before, and it...

Thank you again for your reply. I saved the results according to your prompt, please see the attachment. As you said, **self.is_postscore always equals True**. In addition, I would also...

I made the changes you mentioned and the problem was solved perfectly. Also, I'd like to ask if I want to ignore the score at this point in the **_top-k=1_**...

I hope to use scores to determine which expert to work with, i.e. **_y=expert_n(x), n=softmax(score1, score2....)_**, but I want to ignore the scores i.e. _**y=expert_n(x) instead of y=score_n*expert_n(x)**_. Is this...

Thank you very much, the problem has been solved perfectly~~