PommesPeter

Results 39 comments of PommesPeter

I would like to do some translation work first because I am also new to the DI-engine. Meanwhile, I also try to learn something about RL or MARL for my...

Hi Bruce, Could you give more error messages when you are using our code? @BruceLeeeee

> TypeError: DiT_Llama.**init**() got an unexpected keyword argument 'max_seq_len' Okay, we will check the training code again to fix this problem.

> Hi, @PommesPeter I tried to finetune the model, but no matter how I set it up the image size or batch size or precision (I remove the 'max_seq_len' argument),...

@BruceLeeeee we use `precision: bf16` and `grad_precision: fp32`, you can try these hyper-parameters, and we will check the `model.py` code whether is correct.

> > Traceback (most recent call last): > > File "/mnt/new_sfs_turbo/lsh2/train_projects/Lumina-T2X/lumina_t2i/train.py", line 753, in > > main(args) > > File "/mnt/new_sfs_turbo/lsh2/train_projects/Lumina-T2X/lumina_t2i/train.py", line 514, in main > > loss_dict = transport.training_losses(model,...

Hi @JincanDeng , We have supported Lumina-Next in the diffusers, check the instructions at [here](https://github.com/Alpha-VLLM/Lumina-T2X/blob/main/README.md#fast-demo)

``` Miss Mexico portrait of the most beautiful mexican woman, Exquisite detail, 30-megapixel, 4k, 85-mm-lens, sharp-focus, f:8, ISO 100, shutter-speed 1:125, diffuse-back-lighting, award-winning photograph, small-catchlight, High-sharpness, facial-symmetry, 8k --q 2...

你好,感谢您的贡献,这里是需要使用 apex 的 FusedRMSNorm,不然训练的时候会爆显存,推理的时候可不用。

> > 你好,感谢您的贡献,这里是需要使用 apex 的 FusedRMSNorm,不然训练的时候会爆显存,推理的时候可不用。 > > 您好! > > * 现在的model.py是只引用了apex。因为readme里面写的apex是可选的,如果没装apex的话,这样直接推理会报错 > ![image](https://private-user-images.githubusercontent.com/77225830/330254416-b324708b-7201-4b83-b970-356b078ee1eb.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTU2ODQ4MDEsIm5iZiI6MTcxNTY4NDUwMSwicGF0aCI6Ii83NzIyNTgzMC8zMzAyNTQ0MTYtYjMyNDcwOGItNzIwMS00YjgzLWI5NzAtMzU2YjA3OGVlMWViLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA1MTQlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNTE0VDExMDE0MVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPWNkMThlNTQzZGRmNjMxMThhODNiZDdiOWJhMjRlMDc3ZDNmZGJjNjNkNzNjNmIzZjdjNjdjZDMxZTk0Yzk4MjMmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.c_Fw-ni2-_lkVj0dAef-pjm3FcrRdftoKU8Md6UwrJ4) > * components.py里是这样import的方式才是正确的。所以改成在model里直接import components就行了。 > ![image](https://private-user-images.githubusercontent.com/77225830/330252795-05f5c9b6-e8a1-4252-a842-bdc6f39b6e28.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTU2ODQ4MDEsIm5iZiI6MTcxNTY4NDUwMSwicGF0aCI6Ii83NzIyNTgzMC8zMzAyNTI3OTUtMDVmNWM5YjYtZThhMS00MjUyLWE4NDItYmRjNmYzOWI2ZTI4LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA1MTQlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNTE0VDExMDE0MVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTY3ZDE1MmMxNjViMDg4MmM5NDJmOTM5NjM4NWNlNWMxYjE4NTE2NDkwYWRjYjE5OTI2MGM3NzAyZjQ4ZGU3NzImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.Y5jMMQjcDbkWBLepynjvOB0BLFzzvYCLExBmxcGF9l0) > > 或者您可能需要修改readme,或者加是否引用apex的参数 好的,感谢您修复代码的问题,这是我们 release 时候的问题,稍后再补充一些修改就可以 merge 了。