Jamesgender

Results 2 issues of Jamesgender

Thanks for your nice work! But I have a question about dropblock.The original paper writes "We only sample mask from shaded green region in which each sampled entry can expanded...

When I want to train mamba in other downstream tasks, it is hard to get good results. Any ideas?