mces89

Results 7 issues of mces89

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Can LLaMA-Factory support the cohere's command r plus mode? https://huggingface.co/CohereForAI/c4ai-command-r-plus ### Expected behavior _No...

in-progress

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm trying to do full sft for mixtral 8x22B, I used 2 8xa100(80g) instances....

pending

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm using the latest llmtuner 0.7.0 with following libraries versions: transformers>=4.39.1 accelerate>=0.28.0 bitsandbytes>=0.43.0 and...

solved

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction I'm using the following command which can do the fsdp+qlora for mistral 8x22B in...

pending

### Reminder - [X] I have read the README and searched the existing issues. ### Reproduction Currently the fsdp+qlora only supports int4, is there any reason not supporting int8 too?...

### Reminder - [X] I have read the README and searched the existing issues. ### System Info n/a ### Reproduction n/a ### Expected behavior _No response_ ### Others Hi, I...

pending

### Your current environment The output of `python collect_env.py` ```text Your output of `python collect_env.py` here ``` ### Model Input Dumps _No response_ ### 🐛 Describe the bug Hi, currently...

bug
stale