mamba issues

When we should set d_state=64?

2

According to most cv paper(like vim), they commonly set the d_state=16. But I wonder if d_state=64 should be attached with the number of tokens

CacatuaAlan

Wheel file errors: The cu118 wheel files would install some CUDA 12.1 version packages.

at least including following files: **mamba_ssm-1.2.0.post1+cu118torch2.2cxx11abiTRUE-cp312-cp312-linux_x86_64.whl** **mamba_ssm-1.2.0.post1+cu118torch2.3cxx11abiTRUE-cp312-cp312-linux_x86_64.whl** **mamba_ssm-1.2.0.post1+cu118torch2.3cxx11abiFALSE-cp312-cp312-linux_x86_64.whl**

JayNine25

ImportError: /home/yida/miniconda3/envs/mambair/lib/python3.9/site-packages/selective_scan_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

8

![image](https://github.com/state-spaces/mamba/assets/54884887/fb05bf20-4bd5-43de-90f4-a4dd1805b3df) Have some one meet this question，how to deal this problem. Thanks you.

yidamyth

Issue loading finetuned mamba model into MambaLMHeadModel

3

I am getting the following error trying to load a Mamba model: `TypeError: MambaConfig.__init__() got an unexpected keyword argument '_name_or_path'` This is due to the config.json having this as its...

thomas-bartlett

Are there plans to release mamba for torch 2.3?

Hello, currently I can get mamba working on 2.2 but that breaks flash-attn, which works on 2.3 but breaks on 2.2. Both have the same error of unrecognized symbol importing...

s-rog

Issue: Forward pass of mamba block results in 0s on a Quadro P6000 GPU

12

Once I had my mamba environment set up and mamba-ssm installed successfully, I ran the example code provided by the repository and the output of x and y was as...

ICharlotteI

How can I know what the variables in the code refer to?

1

I studied the paper of mamba and got this code. but I still do not know how to implement it. Because I cannot understand what mean the variable in the...

Liyuansongdsb

Share the environment that worked.

1

I have been successfully run. Environment follows: cuda 11.8 python 3.10.13 pytorch 2.1.1 causal_conv1d 1.1.1 mamba-ssm 1.2.0.post1 ``` pip install torch==2.1.1 torchvision==0.16.1 torchaudio==2.1.1 --index-url https://download.pytorch.org/whl/cu118 pip install causal_conv1d==1.1.1 pip install...

uxhao-o

CUDA version "causal-conv1d installation error"

![image](https://github.com/state-spaces/mamba/assets/130962757/060e4238-1eeb-464a-8409-1627a2bcd96b) "I want to install causal-conv1d, but I encountered the following issue." "How can I ensure that the CUDA version and nvcc version are the same, and which should I...

BNUWUU

What is the reasoning behind adding causal_conv +silu before SSM?

4

I try 2 variants: remove causal conv and remove both causal conv and silu, and they both seem to destabilize training and give me NaN. Is it normal?

yxchng

mamba
mamba copied to clipboard

Metadata

When we should set d_state=64?

Wheel file errors: The cu118 wheel files would install some CUDA 12.1 version packages.

ImportError: /home/yida/miniconda3/envs/mambair/lib/python3.9/site-packages/selective_scan_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

Issue loading finetuned mamba model into MambaLMHeadModel

Are there plans to release mamba for torch 2.3?

Issue: Forward pass of mamba block results in 0s on a Quadro P6000 GPU

How can I know what the variables in the code refer to?

Share the environment that worked.

CUDA version "causal-conv1d installation error"

What is the reasoning behind adding causal_conv +silu before SSM?

← Metadata

Owner

Metadata

mamba mamba copied to clipboard

Metadata

← Metadata

Owner

Metadata

mamba
mamba copied to clipboard