Baoyuan Qi comments

Results 6 comments of


                                            Baoyuan Qi

should use latest version in Dockerfile

I have make a pull request to fix this bug.See https://github.com/wurstmeister/storm-docker/pull/18

[FEATURE: ADD LISA ALGORITHM]

I came up with the code below. The `id` of `optimizer` changes when call `on_train_epoch_start`. BAD THINGS: Still it needs `lightning` package installed and can only be preformed in a...

[FEATURE: ADD LISA ALGORITHM]

I have conducted experiments on llama2-7b using full, lisa_2, lisa_32 methods. From the image above, you can see that the train loss curve decreases and full is the same as...

[BUG] LISA: same loss regardless of lisa_activated_layers

> Hi @geronimi73 I think you are right. > > I also think the main reason is from the optimizer. Directly using `requires_grad` to dynamically select layers cannot achieve dynamically...

[BUG] LISA: same loss regardless of lisa_activated_layers

> Hello, Just stumbled into this while trying to figure out how I could change the optimizer / scheduler in the middle of the training. I have managed to do...

[BUG] LISA: same loss regardless of lisa_activated_layers

I came up with the code below. The `id` of `optimizer` changes when call `on_train_epoch_start`. BAD THINGS: Still it needs `lightning` package installed and can only be preformed in a...