Tony Hutter

Results 241 comments of Tony Hutter

Issue 2, when I login to a machine for the first time and run my test, I always hit this error: ``` SCR v1.2.0: rank 0: Initiating flush of dataset...

> If I re-run the test the problem always goes away. Looks like a stale transfer handle issue. I'll try it with axl_cp and see if the same thing happens....

Fun fact: BBAPI appears to be way slower than a vanilla copy. I timed the amount of time it took to copy a 10GB, random-data, file from SSD to GPFS...

It's possible that the copy was faster because the whole file was in the page cache. That would mean the regular copies were just reading the data from memory rather...

The 0B rank_0 file issue arises because it is a sparse file. ``` $ fiemap /mnt/bb_131fa614a608da727b038ed08e6eaad4/tmp/hutter2/scr.defjobid/scr.dataset.5/rank_0 ioctl success, extents = 0 ``` Since BBAPI transfers extents, it would make sense...

> Can you create an issue describing the problem here: https://github.com/ibm/cast/issues Done: https://github.com/IBM/CAST/issues/918

Another BBAPI observation: I used `test_ckpt` to create a 9GB checkpoint, and then killed it off when it was flushing the checkpoint from SSD to GPFS (using BBAPI). The process...

@tgooding out of curiosity, can the throttling be adjusted or disabled?

@adammoody regarding "should we cancel/not-cancel existing transfers on AXL restart", I forgot about this thread from around a year ago: https://github.com/ECP-VeloC/AXL/issues/57 I'll put my comments in there since it already...