True lazy reading to open EMD files
Currently lazy reading of EDX spectra in EMD files is not truly lazy. The data is stored in a compressed format in the file. HyperSpy reads the compressed data in memory and uncompresses it lazily.
This lead to error messages if the number of frame is consequent.
Implementing pure lazy reading could permit to work with big datasets (i.e. cartography with too much frames)
This turned out to be partially a bug, see hyperspy/hyperspy#2008.
@ZanettaPM, hyperspy/hyperspy#2008 does not implement true lazy reading, but is it enough for your purposes?
I don't know i have to check this. But the dask array might be too large if the number of frames is high right ?
Yes, it keeps the stream array in memory. I've just opened with 40GB of data (uncompressed) whose stream array only took 1GB of memory. Performing a sum over the whole navigation axes took 20s.
Wow, so it should be enough for me !
Could you tested it?
On Fri, 13 Jul 2018 at 13:57, Zanetta Pierre-marie [email protected] wrote:
Wow, so it should be enough for me !
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/hyperspy/rosettasciio/issues/18, or mute the thread https://github.com/notifications/unsubscribe-auth/AA8aF0zqOqg_OzKC_O5Hjvhj9ffql7cTks5uGIsRgaJpZM4URWLa .
Actually not now, but i'll let you know. I'll try this asap !
Reopening because hyperspy/hyperspy#2008 only partially fixes this as it doesn't implement true lazy loading.
The situation should improve further (without fully fixing the issue) with hyperspy/hyperspy#2012