Erjia Guan

Results 170 comments of Erjia Guan

Yeah. And, for this issue, we should definitely unify the buffer_size. For the `infinite` one, I agree `None` is better than `-1`. WDYT whether or not we should raise Error...

Agree on the priority. We can spend a few days on this for BE next half~

I want to add another potential improvement for pyi gen. Currently, the type hint for return value of each functional API is either `IterDataPipe` or `MapDataPipe`. We could change the...

cc: @NivekT Does `parquet` support loading dataframe from binary stream? If so, we might change the behavior of `ParquetDataFrameLoader` from loading by file pathes to loading by binary streams. Then,...

Another feedback from our a downstream library (ray): They previously provided an interface for users to transform `ray.Dataset` into `IterableDataset`. As we plan to collaborate to switch it to `IterDataPipe`,...

I almost forget one of the most important benefits by making TorchData standalone. If we have some changes landed into PyTorch Core that, a dependent changes in TorchData have to...

Want to track the list of features we are currently depending on PyTorch Core: - Dataset/IterableDataset - Profiler - default_collate_fn - Sampler The features that can be moved to TorchData...