fsdp_qlora Request for Scripts to Merge QDoRA Adapters with Base Model for vLLM Inference

Request for Scripts to Merge QDoRA Adapters with Base Model for vLLM Inference

Open iseesaw opened this issue 10 months ago • 4 comments

Hello,

I've successfully finetuned Llama-3 8B with QDoRA and am now looking to perform inference using vLLM. Could you provide guidance or scripts on how to merge the QDoRA adapters with the original base model? Additionally, does this process involve quantization and dequantization of the base model?

Thank you!

Apr 25 '24 07:04 iseesaw

fsdp_qlora fsdp_qlora copied to clipboard

Request for Scripts to Merge QDoRA Adapters with Base Model for vLLM Inference

fsdp_qlora
fsdp_qlora copied to clipboard