Results 2 issues of Ge.Gao

**Describe the bug** I encountered AttributeError: 'MMBTConfig' object has no attribute 'use_return_dict' when using MultiModalClassificationModel. ``` # Create a ClassificationModel args = {'reprocess_input_data': True, 'overwrite_output_dir': True, 'fp16': False, 'n_gpu': 2,...

stale

Hi, awesome project! I am experimenting with using "unsloth/Meta-Llama-3.1-405B-Instruct-bnb-4bit" for inference. I am using 1 A100 GPU with 16 core CPU. However, inference time for one sentence takes 20+ minutes....