GenerativeAIExamples
GenerativeAIExamples copied to clipboard
Generative AI reference workflows optimized for accelerated infrastructure and microservice architecture.
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.3 to 3.1.4. Release notes Sourced from jinja2's releases. 3.1.4 This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise...
Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.2 to 3.1.4. Release notes Sourced from jinja2's releases. 3.1.4 This is the Jinja 3.1.4 security release, which fixes security issues and bugs but does not otherwise...
I am trying the below Lora training notebook https://github.com/NVIDIA/GenerativeAIExamples/blob/main/models/Gemma/lora.ipynb When running the below code segment ``` from nemo.collections.nlp.parts.megatron_trainer_builder import MegatronLMPPTrainerBuilder from nemo.utils.exp_manager import exp_manager trainer = MegatronLMPPTrainerBuilder(cfg).create_trainer() exp_manager(trainer, cfg.exp_manager) ```...
I have gone through the notebooks but couldn't able to stream the tokens from the TensorRTLLM. Here's the issue: ![image](https://github.com/NVIDIA/GenerativeAIExamples/assets/150957746/b8d50a66-5acc-4fa0-a206-a36b6f8eb418) Code used: ```python from langchain_nvidia_trt.llms import TritonTensorRTLLM import time import...
Example 10_RAG_for_HTML_docs_with_Langchain_NVIDIA_AI_Endpoints fails with error: ValueError: The de-serialization relies loading a pickle file. Pickle files can be modified to deliver a malicious payload that results in execution of arbitrary code...
Hi, I am getting 500s with description ``` chat messages must alternate roles between 'user' and 'assistant'. Message may have a leading 'system' role message ``` and ``` "Internal Server...
The code made more structured without complexity. Multiple user session implemented to make the user able to query within their context. implemented vector db aggregation when user add a new...
on [rag in 5 minutes example]( https://github.com/NVIDIA/GenerativeAIExamples/blob/main/examples/README.md#rag-in-5-minutes-example) ![Screenshot 2024-04-07 221636](https://github.com/NVIDIA/GenerativeAIExamples/assets/52562864/e5028d17-5208-4d04-94b2-6ebf4221e3ad) this hyperlink linked to [here](https://github.com/NVIDIA/GenerativeAIExamples/blob/main/docs/ai-foundation-models.md#get-an-api-key-for-the-mixtral-8x7b-instruct-api-endpoint) which has no file. ![Screenshot 2024-04-07 221809](https://github.com/NVIDIA/GenerativeAIExamples/assets/52562864/c561b749-e3be-489c-ada8-f8c18476d7ce)
1. Visit https://github.com/NVIDIA/GenerativeAIExamples/tree/main/examples 2. Click on link embedded in step 3. (Follow the steps mentioned here to get this.) Notice the 404 for the referenced file: https://github.com/NVIDIA/GenerativeAIExamples/blob/main/docs/ai-foundation-models.md#get-an-api-key-for-the-mixtral-8x7b-instruct-api-endpoint