Sidd Karamcheti

Results 16 comments of Sidd Karamcheti

Well that's not good - let me take a look, and see what's going on.

Hey @karan6181 -- the `ChainDataset` solution means that I lose any proportional sampling behavior I'd get by loading multiple streams in a single `StreamingDataset()`. Is there no other way to...

Just following up on this; @karan6181 @lhoestq -- my understanding is that the HF Hub exposes dataset repositories via an `fsspec` API: https://huggingface.co/docs/huggingface_hub/main/en/guides/hf_file_system From the Mosaic Streaming perspective -- can...

Hey @karan6181 -- I'm a bit swamped with upcoming paper deadlines right now, but would love to see this supported. I can try carving out time to work on things...

Chiming in here to +1 the feature request (hey @VictorSanh)! Having support for lists/timeseries is something that would be amazing for some ongoing projects we have at Stanford around imitation...

Follow-up question for @HamidShojanazeri and others maintaining this codebase; can you help explain the difference between using "traditional BF16 mixed precision" and "pure BF16" with/without the AnyPrecisionOptimizer? Specifically, let's say...