Yen-Ting Lin

Results 5 issues of Yen-Ting Lin

**Describe the bug** When attempting to train a model using Nemo 2403, a `ValueError` is raised indicating a conflict between the `precision` argument and the `plugins` argument being passed to...

bug

Hello, I recently read your interesting paper. The results look very promising and I'm excited to try out the COEDIT models. In Section 4 "Experimental Setup" of the paper, several...

Hi @pacman100 , Firstly, thank you for the well-detailed article! I am writing to provide some feedback and seek clarification. 1. **Optimizer Selection:** - The blog post demonstrates the use...

Thank you for the cool project! Could you please elaborate more on how to come up with `13.3`? My understanding is that Number of GPU needed is `Training FLOPs /...

This PR adds support for the TMLU (["Measuring Taiwanese Mandarin Language Understanding"](https://doi.org/10.48550/arXiv.2403.20180) by Chen et al) benchmark dataset. ## Summary - Adds a new dataset `tmlu` with 2,981 multiple-choice questions...