GenerativeAIExamples RAG using Llama2 7b model

RAG using Llama2 7b model

Open saivineethabvns opened this issue 1 year ago • 2 comments

In 05_dataloader.ipynb in examples folder https://github.com/NVIDIA/GenerativeAIExamples/blob/v0.4.0/notebooks/05_dataloader.ipynb is there a way to use Llama2 7b model instead of default Llama2 13b model

Apr 08 '24 09:04 saivineethabvns

Hey @saivineethabvns, yes this is possible!

As a prerequisite step to run this notebook, you need to deploy this example following the steps here https://nvidia.github.io/GenerativeAIExamples/latest/local-gpu.html#using-local-gpus-for-a-q-a-chatbot

Now in this step where you download the model https://nvidia.github.io/GenerativeAIExamples/latest/local-gpu.html#download-the-llama-2-model-and-weights , just download the llama-7b checkpoint instead of llama13b and you should be good to go.

Let me know if you face issues.

Apr 22 '24 13:04 shubhadeepd

https://nvidia.github.io/GenerativeAIExamples/latest/local-gpu.html#using-local-gpus-for-a-q-a-chatbot @shubhadeepd this link is not working now 404 error, does address moved. Can you share the newurl for same local gpu deployment

Jul 11 '24 13:07 rrpatil-1

GenerativeAIExamples GenerativeAIExamples copied to clipboard

RAG using Llama2 7b model

GenerativeAIExamples
GenerativeAIExamples copied to clipboard