Sylvain Gugger comments

Results 631 comments of


                                            Sylvain Gugger

InitProcessGroupKwargs(timeout=timedelta(seconds=3600)) not work !!!!!

cc @muellerzr

[Feature Request] Publish `accelerate` conda package on the `huggingface` channel

cc @muellerzr Could you have a look into it?

ValueError: You can't train a model that has been loaded in 8-bit precision on multiple devices.

> but, even the naive Data Parallel with AllReduce，shouldn't be like this ? `device_map="auto"` is not data parallelism, it's model parallelism (your model is split across the GPUs). It is...

ValueError: prompt is on the meta device, we need a `value` to put in on 0.

Not sure what the issue is here. You have a new tensor and not corresponding weight in the checkpoint so it does not work.

ValueError: prompt is on the meta device, we need a `value` to put in on 0.

> this new tensor needs to be initialized randomly during training So make sure you properly intialize that weight in the `_init_weights` function of your custom model.

Error loading AutoModelForCausalLM with map_device="auto", load_in_8bit=True and fp16=True / weight is on the meta device

You also cannot do `to(xxx)` or `cuda()` for a model loaded with `device_map='auto'`. The model will already be loaded on the GPUs you hava available.

Sylvain Gugger

InitProcessGroupKwargs(timeout=timedelta(seconds=3600)) not work !!!!!

[Feature Request] Publish `accelerate` conda package on the `huggingface` channel

ValueError: You can't train a model that has been loaded in 8-bit precision on multiple devices.

ValueError: prompt is on the meta device, we need a `value` to put in on 0.

ValueError: prompt is on the meta device, we need a `value` to put in on 0.

Error loading AutoModelForCausalLM with map_device="auto", load_in_8bit=True and fp16=True / weight is on the meta device

infer_auto_device_map not work on gpt2

infer_auto_device_map not work on gpt2

infer_auto_device_map calculate bug

infer_auto_device_map calculate bug