mamba
mamba copied to clipboard
This looks like a cool project, but the name seems problematic because of the pre-existing package manager [Mamba](https://github.com/mamba-org/mamba). The Mamba SSM installation instructions currently suggest installing with pip, but Mamba...
Greetings! Thanks for your great work! When I tried the benchmark code, I met the error below. Could you please share some possible solutions? ``` python benchmarks/benchmark_generation_mamba_simple.py --model-name "/home/x/VisionProjects/mamba/ckpts/mamba-130m" --prompt...
Dear Mamba Contributors, I hope this message finds you well. I am in the process of utilising the Mamba state space architecture for a language modelling task and have been...
Amazing work, and I'm inspired by the connections to dynamical systems. Would you mind showing us a minimal example of training or finetuning this?
I see good amount of focus on understanding how to perform full training of Mamba, but what about PEFT? Adapters/LoRA finetuning. The base models are in fact "Ready" for a...
Installing causal-conv1d and mamba-ssm failed. data:image/s3,"s3://crabby-images/c3d17/c3d17238b121189f771dcad32120b560e63451a6" alt="屏幕截图 2023-12-06 163001"
What would be the best way to derive embeddings from mamba models? Is there a straightforward approach or would we need a new architecture?
Hi, I recently looked into your code (csrc/selective_scan/*), and I guess the largest part of the speedup comes from Ktraits::BlockScanT(smem_scan).InclusiveScan. But I see that besides that, you have also implemented...
data:image/s3,"s3://crabby-images/2dd84/2dd8462b2c2ea0f9e294a477b1e9d674f16a57e3" alt="image" Hi, author, where can i find the module named 'selective_scan_cuda'?