mamba icon indicating copy to clipboard operation
mamba copied to clipboard

Replace mamba1 with mamba2 and training becomes very slow!

Open YQ-097 opened this issue 1 year ago • 2 comments
trafficstars

@torch.compile(options={"triton.cudagraphs": True}, fullgraph=True) generates an error. Is there any other way?

YQ-097 avatar Aug 01 '24 14:08 YQ-097

If you use a large model the triton overhead will be neglibile.

tridao avatar Aug 01 '24 23:08 tridao

@torch.compile(options={“triton.cudagraphs”: True}, fullgraph=True) 生成错误。还有其他方法吗?

I encounted some questions when i chose to value mamba2 instead of mamba. image Dose it mean that i should vary the MambaConfig?

dragonBrother1 avatar Aug 14 '24 03:08 dragonBrother1