Yichen Feng

Results 2 comments of Yichen Feng

> @Tangshengku > > Bi-Mamba seems amazing! > > > The ppl is pretty bad with more than 3500+. So, have you ever tested the performance of your implementation before?...

@compilade Thanks for your explanation. When using [unofficial hf mamba-2 model](https://huggingface.co/AntonV/mamba2-130m-hf) or [official hf mamba-1 model](https://huggingface.co/state-spaces/mamba-130m-hf), I face a problem that the model cannot generate the EOS properly, as shown...