NVTabular
NVTabular copied to clipboard
NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.
Makes minor adjustments to fix (some) pytest failures with RAPIDS 24.04. In local testing, the only remaining failures seem to be related to tensorflow.
### Description When running an NVTabular workflow with Categorify operations in Triton Inference Server, the performance is significantly slow when dealing with high cardinality data. ### Environment - Merlin Tensorflow...
Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 2 to 4.1.7. Release notes Sourced from actions/download-artifact's releases. v4.1.7 What's Changed Update @actions/artifact dependency by @bethanyj28 in actions/download-artifact#325 Full Changelog: https://github.com/actions/download-artifact/compare/v4.1.6...v4.1.7 v4.1.6 What's Changed updating @actions/artifact...
**Describe the bug** I am following the code here (https://github.com/pytorch/torchrec/tree/main/torchrec/datasets/scripts/nvt) to preprocess criteo 1tb dataset using Nvtabular. I cannot execute the step workflow.fit, error "RuntimeError: Failed to categorical encode column...
# Motivation Public datasets are generally provided with negative samples to make it easier to train and compare results for different algorithms. Although, the most common for industry use cases...
This repository has been configured to receive notifications when access to NVIDIA's self-hosted runners is within 30 days of expiration. See this [link](https://github.com/nv-gha-runners/enterprise-runner-configuration/blob/main/docs/renewing-access.md) for more information on how to renew...
**Describe the bug** Applying `ops.GroupBy(...)` after `ops.Filter(...)` causes some weird behaviour. Some rows are filled with lists of `nan`s, and rows are not groupped correctly. It seems like the problem...
Hi I am running into an 130 SIGSEGV Error when trying to run my NVTabular + Transformers4Rec training script in a container using the following base image `nvcr.io/nvidia/merlin/merlin-pytorch:23.12` I believe...
Dear NVTabular team, I'm following your instruction on NVTabular demo in this link: https://nvidia-merlin.github.io/NVTabular/v0.5.0/examples/advanced-ops-outbrain/03-Training-with-TF.html. However, it seems that the Outbrain's data set is no longer downloadable from Kaggle. Do you...
Hi All, Is container support dropped at this point? The latest container from the NGC registry is from 09/05/2024. It says that the definition was updating on July 31, 2025,...