NeMo-Guardrails issues

Using Lynx 70B Cuda out of memory

1

Hello! I'm running Nemo Guardrails on Google Colab using the T4 GPU. However, when I deploy Lynx 70b using this code: `!python -m vllm.entrypoints.openai.api_server --port 5000 --model 'PatronusAI/Patronus-Lynx-70B-Instruct'` I have...

sjay8

question

Nemo/Langchain with Chainlit

1

Hello, I am building a RAG bot that utilizes Langchain and Nemo Guardrails. I currently have the chain portion of the code configured as the following: ```python from langchain.chains import...

BWardell15

Concept: Asyncronously passing user/session information to a guardrails action

3

Here's a scenario: we are running guardrails server, which uses FastAPI. Let's imagine that we'd like to keep track of each user that is using our guardrails server. For this,...

serhatgktp

1 validation error for HuggingFaceHub

1

I run it in colab, config.yml: ```yaml models: - type: main engine: huggingface_hub model: baichuan-inc/Baichuan2-7B-Chat rails: input: flows: - self check input output: flows: - self check output ``` jupyter...

qifuxiao

question

TypeError: cannot pickle '_thread.RLock' object

12

# Summary I started working with Nemo and instantly adopted it for wrapping our RAG chain which is written using langchain. It works great when i am working in notebook...

DixitAdh

bug

Feature - Ollama Embeddings

1

Hi, thank you for all the work around guardrails, I am using NeMo in my project & it is really awesome. This is my first contribution to the repo, happy...

kauabh

Error configuring NeMo Guardrails with a TensorRT LLM serving on TRT-LLM server

Has anyone successfully integrated TensorRT LLM with NeMo Guardrails? There is a lack of documentation on utilizing TensorRT LLM for Nemo Guardrails so I am hoping for some guidance here....

wesngoh

Fix/nvidia ai endpoints streaming

This PR fixes the streaming issue introduced with langchain_nvidia_ai_endpoints package. - It creates a patched version of `ChatNVIDIA` based on the version of the `langchain_nvidia_ai_endpoints` package. Two patched versions are...

Pouyanpi

Streams intermediate LLM calls in custom action.

1

I am using a custom action which is make up of several chains from langchain. I am using only self-check-input in configuration and I am streaming the output. ``` rag_chain...

mikeolubode

status: needs triage

passing in relevant_chunks as a message's context

2

[In the Rag example, passing in the relevant_chunks as a message is possible](https://github.com/NVIDIA/NeMo-Guardrails/blob/26304fd0bbf51f2c0c8ca0cb1ed242775f45dfe3/docs/getting_started/7_rag/rag.ipynb#L203) ``` response = rails.generate(messages=[{ "role": "context", "content": { "relevant_chunks": """ Employees are eligible for the following time...

johnyoonh

NeMo-Guardrails
NeMo-Guardrails copied to clipboard

Metadata

Using Lynx 70B Cuda out of memory

Nemo/Langchain with Chainlit

Concept: Asyncronously passing user/session information to a guardrails action

1 validation error for HuggingFaceHub

TypeError: cannot pickle '_thread.RLock' object

Feature - Ollama Embeddings

Error configuring NeMo Guardrails with a TensorRT LLM serving on TRT-LLM server

Fix/nvidia ai endpoints streaming

Streams intermediate LLM calls in custom action.

passing in relevant_chunks as a message's context

← Metadata

Owner

Metadata

NeMo-Guardrails NeMo-Guardrails copied to clipboard

Metadata

← Metadata

Owner

Metadata

NeMo-Guardrails
NeMo-Guardrails copied to clipboard