tensorrtllm_backend icon indicating copy to clipboard operation
tensorrtllm_backend copied to clipboard

[request] Add example of custom LLM model not based on huggingface

Open michaelnny opened this issue 1 year ago • 0 comments

Hi,

I'm wondering if it's possible to add example (or general guideline) of how to serving custom LLM model that's not based on huggingface.

As an example, we could use the original Llama3 chat model with the native Tiktoken tokenizer, which are not based on huggingface transformers: https://github.com/meta-llama/llama3

This will be great for people that are working with custom LLM models that are decoupled from the huggingface ecosystem, thanks!

michaelnny avatar May 20 '24 12:05 michaelnny