parap1uie-s
parap1uie-s
如题,为什么不直接调用nn.LayerNorm?
As the issue title. My base module is a 3-layers GRU, and the synthetic module is another RNN. I want to training base module in `BPTT` mode without synthetic gradients,...
Hi, we re-implemented ACNet based on PaddlePaddle, which is included in official models. [models](https://github.com/PaddlePaddle/models/blob/develop/PaddleCV/image_classification/models/resnet_acnet.py) [weight utils](https://github.com/PaddlePaddle/models/tree/develop/PaddleCV/image_classification/utils/acnet) Nice work to ACNet!
### Describe the issue Issue: I am finetuning LLaVA 1.5 13b using `scripts/v1.5/finetune_task_lora.sh` on my custom dataset. Training process looks normal (~0.4) until a iteration (randomly, no pattern found yet),...