Saaketh Narayan
Saaketh Narayan
Hey @AugustDev, in your case, you have a whole lot of samples and storing the sample partition array is taking up a good amount of space. However, according to your...
Hey @AugustDev, we've been able to train on datasets that have that many (or more) samples -- this is likely an issue particular to your dataset. Are you trying to...
@XiaohanZhangCMU Mind adding the tests mentioned above and we can get this one in?
Hey @TAYmit that seems indicative of a bug on our side, would it be possible for you to share a shard file or a small repro of this behavior with...
Hey @TAYmit , any luck repro-ing on a smaller dataset?
Hey, @janEbert this seems sensible! We have chosen not to cache the epoch sample id tensor mainly because persistent storage may not be available in many training setups. So reading...