mces89
mces89
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Can LLaMA-Factory support the cohere's command r plus mode? https://huggingface.co/CohereForAI/c4ai-command-r-plus ### Expected behavior _No...
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm trying to do full sft for mixtral 8x22B, I used 2 8xa100(80g) instances....
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm using the latest llmtuner 0.7.0 with following libraries versions: transformers>=4.39.1 accelerate>=0.28.0 bitsandbytes>=0.43.0 and...
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm using the following command which can do the fsdp+qlora for mistral 8x22B in...
### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Currently the fsdp+qlora only supports int4, is there any reason not supporting int8 too?...
### Reminder - [X] I have read the README and searched the existing issues. ### System Info n/a ### Reproduction n/a ### Expected behavior _No response_ ### Others Hi, I...
### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### Model Input Dumps _No response_ ### 🐛 Describe the bug Hi, currently...