mamba icon indicating copy to clipboard operation
mamba copied to clipboard

Question about support for sequence parallel

Open zigzagcai opened this issue 1 year ago • 7 comments
trafficstars

Hi,

I recently learnt about this selective SSM architecture, and it was awesome! But I have some questions. We know that the Transformer architecture supports sequence parallelism, so does Mamba (the potential alternative of Transformer) support sequence parallelism?

zigzagcai avatar Feb 19 '24 06:02 zigzagcai

In general, yes. Which flavor of sequence parallelism are you referring to? The one in Megatron-LM?

tridao avatar Feb 19 '24 06:02 tridao

In general, yes. Which flavor of sequence parallelism are you referring to? The one in Megatron-LM?

Thanks for your timely response! Sure. I am referring to the one in Megatron-LM. I am wondering does Mamba has built-in support for this kind of sequence parallel, or we need to implement it manually?

zigzagcai avatar Feb 19 '24 06:02 zigzagcai

Nothing is built-in, but it'll be implemented in the future.

tridao avatar Feb 19 '24 06:02 tridao

Got it. Thanks!

zigzagcai avatar Feb 19 '24 07:02 zigzagcai