Benjamin Bossan comments

Results 794 comments of


                                            Benjamin Bossan

Transfer skorch models from CUDA to CPU for inference

Hmm, I can't reproduce this. Here is the script that I used: ```python import pickle import sys import numpy as np import torch from sklearn.datasets import make_classification from torch import...

Transfer skorch models from CUDA to CPU for inference

Thanks for the reproducer. I could verify that this fails at loading the model on a CPU machine. I tried to debug a little bit and it appears that when...

Transfer skorch models from CUDA to CPU for inference

Thanks for providing further information. Without digging deeper: When pickling, skorch checks attributes with a CUDA-dependency, pops them from the pickle state, and saves them in a way that allows...

DoRA implementation differs from PEFT DoRA

Thanks for the quick response. It can indeed be a bit confusing on what axis the DoRA scaling should be applied, especially with the transpose operation that's implicit in the...

DoRA implementation differs from PEFT DoRA

I agree that breaking the existing method is not a good idea. Whether adding a new option to use the other axis is worth it, I don't know. I dug...

DoRA implementation differs from PEFT DoRA

Maybe @nbasyl can comment on the notation and if it would make sense to have an option to swap the axis.

Error running notebook launcher in google Colab

I have very little experience with google colab or XLA, but to me this looks like a PyTorch-XLA error and not something specific to accelerate notebook launcher or even accelerate...

Error running notebook launcher in google Colab

Thanks for testing again. I agree that it's strange that the errors are random and that this could be caused by a race condition. I asked internally if there is...

_prepare_deepspeed fail to capture correct kwargs with DummyOptim or DummyScheduler when calling prepare() multiple times

Do you really need to call `prepare` multiple times? You should be able to run `prepare` in a single call, right? ```python return_values = self.accelerator.prepare(*accelerator_to_prepare.values()) for k, val in zip(accelerator_to_prepare.keys(),...

_prepare_deepspeed fail to capture correct kwargs with DummyOptim or DummyScheduler when calling prepare() multiple times

The deepspeed init logic is probably not easy to fix, but I'll wait for Zach's return to comment on that. Regarding the docs, yes, probably it should be highlighted that...