Duc Quang Nguyen

Results 3 comments of Duc Quang Nguyen

I have not faced this issue. Can you give me the reproducing command.

Have you merged my pull request about adding mixtral? If not, you can use my modified repo here: https://github.com/martinakaduc/LLaVA My pretraining script: `deepspeed llava/train/train_mem.py --deepspeed ./scripts/zero3_offload.json --model_name_or_path mistralai/Mixtral-8x7B-Instruct-v0.1 --version plain...

I think it is possible. However I have not tested yet.