Jamesgender
Results
2
issues of
Jamesgender
Thanks for your nice work! But I have a question about dropblock.The original paper writes "We only sample mask from shaded green region in which each sampled entry can expanded...
When I want to train mamba in other downstream tasks, it is hard to get good results. Any ideas?