Daniel Han comments

Results 781 comments of


                                            Daniel Han

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

@skmanzg Could you try before running Unsloth: ```python import os os.environ["CUDA_DEVICE_ORDER"] = "PCI_BUS_ID" os.environ["CUDA_VISIBLE_DEVICES"] = "0" ```

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1!

Great!

ImportError: cannot import name 'is_bfloat16_supported' from 'unsloth' on local GPU

@regstuff Could you try uninstalling then installing Unsloth ie: ```python pip uninstall unsloth -y pip install --upgrade --force-reinstall --no-cache-dir git+https://github.com/unslothai/unsloth.git ```

ImportError: cannot import name 'is_bfloat16_supported' from 'unsloth' on local GPU

Oh interesting on packaging

Apply unsloth optimizations

If this is a major request by the OSS community - I'm more than happy to include some of the changes from Unsloth!

Apply unsloth optimizations

Yes 0% loss in accuracy - we do actual FLOP reductions via our manual autograd engine. I'm actually working with @casper-hansen and some other Axolotl people to put some methods...

Apply unsloth optimizations

@casper-hansen Oh cool - I'll have a look! Ye I'll try to make a PR to axolotl!!

Apply unsloth optimizations

@fakerybakery Sorry not yet - I'll take a look at the PR Casper made, but it might take some time

Can't use output_attentions when using unsloth

I don't think that'll work :( We use FA2 and SDPA so the attention output is actually never constructed