das icon indicating copy to clipboard operation
das copied to clipboard

zarr temp store files filling up local drive

Open eintzandt opened this issue 1 year ago • 6 comments

I have an external drive (D:) on my desktop PC that I am using to store my large datasets (typically a directory containing 15 1-hour long wavfiles) that I want to predict using a trained DAS model. However, I am often running into a storage error "No space left on device". image My D: drive has ~1TB of available space, but my local drive is filling up with temp files when DAS is running the predictions. Is there something I did during installation that is making the temp file path be on my local C: drive instead of my larger external D: drive? Is there a way to reroute these temp files to the D: drive so my local drive doesn't fill up? My temporary solution is to go into my temp directory (C:/.../AppData/Local/Temp/) and manually delete the directories created ('zarr...'), which are ~5GB per 1 hour file I am predicting on. I think the code that is creating these temp files is the TempStore class in das/lib/site-packages/zarr/storage.py, I just don't know if there is a way in DAS to redirect these files. image Thank you!

eintzandt avatar Feb 20 '24 14:02 eintzandt

Hi, I can see that this is a bit of an issue. We create these temporary files to enable processing of large files that do not fit in memory - everything is processed in chunks taken from the temp file. However, the temp files should be deleted when you close the audio file in DAS or at the very least when you close DAS. But I guess you did that and the temp files persisted?

postpop avatar Feb 21 '24 16:02 postpop

I am using the GUI to predict on a whole folder, so DAS isn't closing between opening files. The zarr temp files do not delete after finishing predictions for one file in the folder and starting the next file.

eintzandt avatar Feb 29 '24 22:02 eintzandt

Hi, in the newest version 0.32.1, we now explicitly delete the temporary directory after each file was predicted. Can you give this a try and let me know whether it fixes your issue? Thanks.

postpop avatar Mar 11 '24 13:03 postpop

Hello, I am experiencing the same issue of accumulating large zarr folders in Temp when predicting across a folder of wav files. I updated to 0.32.2 from 0.31.0 and the issue persists. The folders are deleted when I close DAS but I am hoping to process large amounts of data overnight if possible. I am using Windows 11.

Thank you

jynkjp avatar Apr 16 '24 07:04 jynkjp

We now manually delete the temporary folders after each file was predicted but maybe we've missed sth. Are you processing a folder full of files? Are you using the command line or the graphical user interface to do that? Thanks!

postpop avatar Apr 16 '24 08:04 postpop

Thank you for the quick reply. I am processing a folder full of files that is on a Network-Attached Storage server (not OS drive). I am using command line to process with increased batch size but I just tested it on the GUI and the issue is there also.

jynkjp avatar Apr 16 '24 09:04 jynkjp