pytorch_optimizer issues

20

Hi, thank you so much for your repo, I am using SAM optimizer but I am facing this error, how to fix this? RuntimeError: [-] Sharpness Aware Minimization (SAM) requires...

manza-ari

question

Versions of codes that work with half precision models

1

Hi I just discovered your repo and I would like to try it to fine-tune my ParlAI blenderbot2 (see https://github.com/facebookresearch/ParlAI) model. However, I am running the model in FP16 precision...

sjscotti

feature request

[Feature request]REX LR scheduler

1

## Paper or Code REX LR scheduler From https://arxiv.org/abs/2107.04197 Implementation is based on https://github.com/Nerogar/OneTrainer/blob/2c6f34ea0838e5a86774a1cf75093d7e97c70f03/modules/util/lr_scheduler_util.py#L66

sdbds

feature request

Improvement to SAM: SAM as an Optimal Relaxation of Bayes

1

## SAM as an Optimal Relaxation of Bayes > Sharpness-aware minimization (SAM) and related adversarial deep-learning methods can drastically improve generalization, but their underlying mechanisms are not yet fully understood....

redknightlois

feature request

Plans for pytorch_optimizer v3

1

In `pytorch-optimizer v3`, `loss function` will be added. So, finally, the optimizer & lr scheduler & loss function are all in one package. ## Feature - [x] support at least...

kozistr

feature

sophiah in https://github.com/booydar/LM-RMT

8

robotzheng

bug

Request to add 4-bit AdamW

3

## Paper and Code Paper: [Memory Efficient Optimizers with 4-bit States](https://arxiv.org/abs/2309.01507) Code : https://github.com/thu-ml/low-bit-optimizers/blob/main/lpmm/optim/optimizer.py

LiutongZhou

feature request

Updated Shampoo uber slow performance

10

I just swap out Nero optimizer in my Lightning AI loop and gave the new Shampoo a try. There is something going on with it, as this card is typically...

redknightlois

performance

VeLO: Training Versatile Learned Optimizers by Scaling Up

1

https://arxiv.org/abs/2211.09760 > While deep learning models have replaced hand-designed features across many domains, these models are still trained with hand-designed optimizers. In this work, we leverage the same scaling approach...

redknightlois

feature request

pytorch_optimizer
pytorch_optimizer copied to clipboard

Metadata

SAM::RuntimeError: stack expects a non-empty TensorList

Sharpness Aware Minimization (SAM) requires closure

Versions of codes that work with half precision models

[Feature request]REX LR scheduler

Improvement to SAM: SAM as an Optimal Relaxation of Bayes

Plans for pytorch_optimizer v3

sophiah in https://github.com/booydar/LM-RMT

Request to add 4-bit AdamW

Updated Shampoo uber slow performance

VeLO: Training Versatile Learned Optimizers by Scaling Up

← Metadata

Owner

Metadata

pytorch_optimizer pytorch_optimizer copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch_optimizer
pytorch_optimizer copied to clipboard