text-generation-inference
text-generation-inference copied to clipboard
Allow multi-lora in Messages API
Feature request
Multi lora support in TGI has been around since 2.0.6, but it is not compatible with the Messages API using the openai package.
Motivation
The openai chat completion approach is an industry standard at this point, so having it be compatible with multi lora would be great.
Your contribution
.