Jeremy Cochoy comments

Results 8 comments of


                                            Jeremy Cochoy

`Error while converting op of type: Conv. Error message: provided number axes -1 not supported`

> @kossnick Can you please try converting the model using the change in PR > #524 ? I Had exactly this problem with my "home made" model. Forcing the rank...

`Error while converting op of type: Conv. Error message: provided number axes -1 not supported`

May be the source of the problem. But I don't know if the previous implementation (`rank = len(graph.shape_dict[node.inputs[0]])`) would work with the current code base. Unfortunately I don't have the...

Conversion of an fft and ifft Lambda layer

@breizhn Unfortunately I got a little overwhelmed by work and didn't progressed on the fft/ifft operators PR. I should definitively resume this. But this is only for the specification part....

Missing import from train/train.py for LORA training

Thats indeed what I have done but this seams to be insufficient to run the original LORA configuration. I was able to reproduce the original lora training from the original...

Missing import from train/train.py for LORA training

Thanks. I will have a look this evening and keep you updated 👍

Missing import from train/train.py for LORA training

I tried the last head. The code do seams to run (i.e. what I got when I copy pasted the missing functions into the file) however I imediately get an...

Missing import from train/train.py for LORA training

> If there is only one gpu, maybe you can directly run `train_lora.py` without FSDP(in case it's FS-Data-Parallel). Besides, as mentioned [here](https://github.com/lm-sys/FastChat/blob/main/fastchat/train/train_lora.py#L97-L101), gradient checkpointing with LoRA needs a monkey patch...

undefined reference error

I just tried to compile it right now on an ubuntu VM, and didn't get any error. But I noticed that the README.md was unclear. Did you created a `build`...