Jeff Rasley comments

Results 84 comments of


                                            Jeff Rasley

trafficstars

[BUG]pip install doesn't work. Please eeelp.

I've repro'd your issue, will let you know when i have a fix. Our support on windows is unfortunately not as thoroughly tests as on linux. I recognize how funny...

[BUG]pip install doesn't work. Please eeelp.

I repro'd this on a windows box that does not have a GPU. Can you confirm that torch sees your gpu from windows? Can you share the results of `torch.cuda.is_available()`...

DeepSpeed Inference support for OPT

This is definitely on our upcoming TODO list to investigate. Are you saying you've tried your own custom kernel injection policy and it (partially?) works?

[BUG][0.6.7] garbage output for multi-gpu with tutorial

Thank you for reporting this! I've verified we can repro this on our side as well, but only when using >1 gpus. There's a gap currently in our CI tests...

[BUG] Cannot run DeepSpeed with transformers on NVIDIA Tesla T4 GPU

Hi @lanking520, I just tried all your repro steps above and was not able to repro the stack trace. Can you confirm what `transformers` version you are using? I tried...

[BUG] Cannot run DeepSpeed with transformers on NVIDIA Tesla T4 GPU

Also, just to double check, you can run fine if you remove `deepspeed.init_inference` right?

[REQUEST] M1 Max support

We’ve definitely been watching if/when pytorch will support M1, it sounds like it’s planned though. https://github.com/pytorch/pytorch/issues/47702 Specifically see this comment: https://github.com/pytorch/pytorch/issues/47702#issuecomment-965625139 In terms of DeepSpeed support for M1 I suspect...

Jeff Rasley

[BUG]pip install doesn't work. Please eeelp.

[BUG]pip install doesn't work. Please eeelp.

DeepSpeed Inference support for OPT

[BUG][0.6.7] garbage output for multi-gpu with tutorial

[BUG] Cannot run DeepSpeed with transformers on NVIDIA Tesla T4 GPU

[BUG] Cannot run DeepSpeed with transformers on NVIDIA Tesla T4 GPU

[REQUEST] M1 Max support

Add explicit gradient_accumulation_dtype config

[BUG] GPT-J InferenceEngine Initialization Failure: `RuntimeError`

AttributeError: 'FP16_DeepSpeedZeroOptimizer' object has no attribute 'ipg_index'