Ayush Dattagupta
Ayush Dattagupta
Hey @NielsRogge. Looking through the pdf version of the paper I do see updated links to the [models](https://huggingface.co/nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16) and v2 dataset. The link to the code in Curator https://github.com/NVIDIA-NeMo/Curator/tree/experimental/experimental/nvpdftex seems...
Thanks for confirming. Looks like the pdf/arxiv paper does link to the dataset [here](https://huggingface.co/datasets/nvidia/Nemotron-VLM-Dataset-v2) but the dataset card is missing the citation that links it to the papers page. I've...
Thanks for opening @federico-dambrosio. Quick update: I have been able to reproduce but don't have a concrete root cause yet. We'll share an update soon when we have something.