haystack
haystack copied to clipboard
Support dataset streaming
Is your feature request related to a problem? Please describe. When I want to train a reader, my dataset uses 40Gb RAM. I only have 12Gb available. Streaming (loading only necessary part of dataset into memory for each batch) would fix this.