GenerativeAIExamples icon indicating copy to clipboard operation
GenerativeAIExamples copied to clipboard

Error message has incorrect model engine name of nemo-infer instead of ai-playground

Open nathankw opened this issue 1 year ago • 4 comments

RetrievalAugmentedGeneration.common.utils.get_llm() reports the incorrect model_engine name in the last line of the function in error string. Says "Supported engines are triton-trt-llm and nemo-infer", but should say "Supported engines are triton-trt-llm and ai-playground".

In the config.yaml file, we must specify llm.model_engine as one of [triton-trt-llm, ai-playground].

nathankw avatar Jan 02 '24 22:01 nathankw

hii i am new here i have corrected the error msg now i am confused in config.yaml , previously it was model_engine="triton-trt-llm", now should i add both triton-trt-llm, ai-playground in a list or just separate them by comma Screenshot 2024-01-07 122139

mohammedpithapur avatar Jan 07 '24 06:01 mohammedpithapur

@mohammedpithapur Thanks for fixing the error message! It should be either model_engine: "triton-trt-llm" OR model_engine: "ai-playground"

shubhadeepd avatar Jan 08 '24 19:01 shubhadeepd

Can you suggest how I can do it, sorry I am new

mohammedpithapur avatar Jan 09 '24 16:01 mohammedpithapur

@mohammedpithapur we now have a dedicated example to showcase AI playground in latest codebase! No config file changes are needed (this avoids the confusion you faced). Please checkout the example here https://github.com/NVIDIA/GenerativeAIExamples/blob/main/RetrievalAugmentedGeneration/README.md#1-qa-chatbot----nvidia-ai-foundation-inference-endpoint

shubhadeepd avatar Jan 29 '24 15:01 shubhadeepd