NVTabular icon indicating copy to clipboard operation
NVTabular copied to clipboard

NVTabular is a feature engineering and preprocessing library for tabular data designed to quickly and easily manipulate terabyte scale datasets used to train deep learning based recommender systems.

Results 172 NVTabular issues
Sort by recently updated
recently updated
newest added

Makes minor adjustments to fix (some) pytest failures with RAPIDS 24.04. In local testing, the only remaining failures seem to be related to tensorflow.

bug

### Description When running an NVTabular workflow with Categorify operations in Triton Inference Server, the performance is significantly slow when dealing with high cardinality data. ### Environment - Merlin Tensorflow...

bug

Bumps [actions/download-artifact](https://github.com/actions/download-artifact) from 2 to 4.1.7. Release notes Sourced from actions/download-artifact's releases. v4.1.7 What's Changed Update @​actions/artifact dependency by @​bethanyj28 in actions/download-artifact#325 Full Changelog: https://github.com/actions/download-artifact/compare/v4.1.6...v4.1.7 v4.1.6 What's Changed updating @actions/artifact...

dependencies
github_actions

**Describe the bug** I am following the code here (https://github.com/pytorch/torchrec/tree/main/torchrec/datasets/scripts/nvt) to preprocess criteo 1tb dataset using Nvtabular. I cannot execute the step workflow.fit, error "RuntimeError: Failed to categorical encode column...

bug

# Motivation Public datasets are generally provided with negative samples to make it easier to train and compare results for different algorithms. Although, the most common for industry use cases...

Sampling

This repository has been configured to receive notifications when access to NVIDIA's self-hosted runners is within 30 days of expiration. See this [link](https://github.com/nv-gha-runners/enterprise-runner-configuration/blob/main/docs/renewing-access.md) for more information on how to renew...

nvidia-runners

**Describe the bug** Applying `ops.GroupBy(...)` after `ops.Filter(...)` causes some weird behaviour. Some rows are filled with lists of `nan`s, and rows are not groupped correctly. It seems like the problem...

bug

Hi I am running into an 130 SIGSEGV Error when trying to run my NVTabular + Transformers4Rec training script in a container using the following base image `nvcr.io/nvidia/merlin/merlin-pytorch:23.12` I believe...

question

Dear NVTabular team, I'm following your instruction on NVTabular demo in this link: https://nvidia-merlin.github.io/NVTabular/v0.5.0/examples/advanced-ops-outbrain/03-Training-with-TF.html. However, it seems that the Outbrain's data set is no longer downloadable from Kaggle. Do you...

question

Hi All, Is container support dropped at this point? The latest container from the NGC registry is from 09/05/2024. It says that the definition was updating on July 31, 2025,...

question