Saaketh Narayan

Results 96 comments of Saaketh Narayan

Hey @AugustDev, in your case, you have a whole lot of samples and storing the sample partition array is taking up a good amount of space. However, according to your...

Hey @AugustDev, we've been able to train on datasets that have that many (or more) samples -- this is likely an issue particular to your dataset. Are you trying to...

@XiaohanZhangCMU Mind adding the tests mentioned above and we can get this one in?

Hey @TAYmit that seems indicative of a bug on our side, would it be possible for you to share a shard file or a small repro of this behavior with...

Hey @TAYmit , any luck repro-ing on a smaller dataset?

Hey, @janEbert this seems sensible! We have chosen not to cache the epoch sample id tensor mainly because persistent storage may not be available in many training setups. So reading...