Tri Dao comments

Results 280 comments of


                                            Tri Dao

a problem when running "CAUSAL_CONV1D_FORCE_BUILD=TRUE pip install ."

The error message says you need `nvcc`

a problem when running "CAUSAL_CONV1D_FORCE_BUILD=TRUE pip install ."

Sorry idk much about windows

mamba-ssm for macos with M1

No we have no plan for Mac.

ImportError: /home/yida/miniconda3/envs/mambair/lib/python3.9/site-packages/selective_scan_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN3c105ErrorC2ENS_14SourceLocationENSt7__cxx1112basic_stringIcSt11char_traitsIcESaIcEEE

v1.2.1 now includes wheels for pytorch 2.3.

What is the reasoning behind adding causal_conv +silu before SSM?

We've trained from scratch without causal conv and it's still fine, just worse quality.

What is the reasoning behind adding causal_conv +silu before SSM?

Yes, worse validation loss without the conv1d.

Training mamba model with Huggingface transformers from scratch

I'm not familiar with the implementation in HF. In any case it's just some model code so I think training from scratch should work.

thop fails to count the flops and parameters of the custom operator Mamba

Counting params should just be `sum(p.numel() for p in model.parameters())`. I'm not familiar with thop. I assume there's some way to specify how many flops a custom operation takes.

Adding Initial Value Support to Selective Scan Forward Kernel

The initial value is set in the prefix_op. You probably want to change this line: https://github.com/state-spaces/mamba/blob/12d855003ba92c8a15d1739ce65a14c6fb16e254/csrc/selective_scan/selective_scan_fwd_kernel.cuh#L239 to something like (I haven't tested this): ``` if (chunk == 0) { running_prefix...

Adding Initial Value Support to Selective Scan Forward Kernel

You can print stuff out with printf to see if you're accessing the right indices.