Sourab Mangrulkar
Sourab Mangrulkar
Without using the env variable `TORCHDYNAMO_DEBUG_FUNCTION=forward`, I get the following error: ``` File "/raid/sourab/transformers/src/transformers/models/bert/modeling_bert.py", line 286, in forward mixed_query_layer = self.query(hidden_states)result = forward_call(*args, **kwargs) File "/raid/sourab/miniconda3/envs/hf/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl...
Hello @aohan237 , DeepSpeed isn't compatible with bitsandbytes 4-bit quantization. It is also not compatible with torch compile.
Hello @stas00, @tjruwase, @muellerzr and @BenjaminBossan; Would be interested in knowing your thoughts.
Gentle ping @ashwinb as you had raised the related commit. Please look into it, Thank you!