Duc Quang Nguyen
Results
3
comments of
Duc Quang Nguyen
I have not faced this issue. Can you give me the reproducing command.
Have you merged my pull request about adding mixtral? If not, you can use my modified repo here: https://github.com/martinakaduc/LLaVA My pretraining script: `deepspeed llava/train/train_mem.py --deepspeed ./scripts/zero3_offload.json --model_name_or_path mistralai/Mixtral-8x7B-Instruct-v0.1 --version plain...
I think it is possible. However I have not tested yet.