Nadav Schneider

Results 1 issues of Nadav Schneider

Hi! I'm using SFTTrainer (inherited from Transformers Trainer) to fine-tune Mamba2. When using cuda_kernels_forward in Mamba2 on multiple GPUs the following error appears (full traceback in the end): ``` config.pre_hook({**self.nargs,...