TensorRT-LLM Use 2 Lora in one request

I have a base mode: model_0. I created a Lora which is corresponding to instruction tuning: lora_1 The we merged model_0 + lora_1 to create: model_1. Then we created a Lora based on model_1 for DPO: lora_2.

final model is model_0 + lora_1 + lora_2.

But in inference time i can only use only one lora in lora_config:

lora_config = trtllm.LoraConfig(task_id=task_id, weights=weights, config=config)
trtllm.Request(input_token_ids=input_ids, ..., lora_config=lora_config)

I want to set that use lora_1 and lora_2 together.

Jul 27 '24 08:07 Alireza3242

Can you just use your base model to be model_1? Or are you needing to call model_0 still?

Might be worth going through the matrix operations to determine if you can come up with lora_3 which is the equivalent of lora_1 + lora_2 (doesn't seem likely to me though)

Jul 31 '24 14:07 TheCodeWrangler

Can you just use your base model to be model_1? Or are you needing to call model_0 still?

Might be worth going through the matrix operations to determine if you can come up with lora_3 which is the equivalent of lora_1 + lora_2 (doesn't seem likely to me though)

I need to call model_0. Because I may need to add other Loras. Probably lora_3=lora_1+lora_2 will solve my problem. I will try it.

Aug 01 '24 06:08 Alireza3242

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

Sep 01 '24 02:09 github-actions[bot]

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 15 days."

Oct 03 '24 02:10 github-actions[bot]