Kevin Tse issues

Results 35 issues of


                                            Kevin Tse

StreamingASR's `run_sasr.py` doesn't work with Python 3.10

It seems like StreamingASR doesn't work with Python 3.10 due to an issue within PyAudio. Executing `python run_sasr.py` will lead an error `[PY_SSIZE_T_CLEAN macro must be defined for '#' formats]`....

[WIP] Benchmark Script

Stack from [ghstack](https://github.com/ezyang/ghstack): * __->__ #734 This PR is primarily focused on adding more datasets for benchmarking. Notable changes that are in progress: - Using `PrototypeMultiprocessingReadingService` as that will become...

CLA Signed

[DataPipe] Add RandomSplitter (without buffer)

Stack from [ghstack](https://github.com/ezyang/ghstack): * __->__ #724 This PR adds RandomSplitter without a buffer. The upside is that this uses less memory (good for memory-bound cases) but the downside are 1)...

CLA Signed

topic: new feature

[DataLoader] Add len to DataLoader2

Stack from [ghstack](https://github.com/ezyang/ghstack): * __->__ #728 * #746 Adding `__len__` to `DataLoader2`. See inline comments. We should discuss the details and if this makes sense. Fixes #549 Differential Revision: [D38999743](https://our.internmc.facebook.com/intern/diff/D38999743)

CLA Signed

topic: improvements

Kevin Tse

StreamingASR's `run_sasr.py` doesn't work with Python 3.10

[WIP] Benchmark Script

[DataPipe] Add RandomSplitter (without buffer)

[DataLoader] Add len to DataLoader2

Add Examples of Common Preprocessing Steps with IterDataPipe (such as splitting a data set into two)

Linter for DataPipe/DataLoader2

DataLoader2 Feature Documentation

[RFC] Restricting `IterDataPipe` to have method `iter` as a generator function without method `next`

DataLoader2 should reset when a new iterator is created?

`in_batch_shuffle` currently doesn't use shared RNG