Adam Moody
Adam Moody
Drops ```SCR_Start_checkpoint``` and ```SCR_Complete_checkpoint```. Adds an output flag to complete functions to return status to caller rather than using the return code. ``` SCR_Complete_restart(int valid, int* allvalid) SCR_Complete_output(int valid, int*...
A common pattern in some codes is to have rank 0 write a root file and then every process, rank 0 included, each writes a per-process checkpoint file. The root...
SCR maintains an internal counter to track the order of checkpoints an application has written. With each new checkpoint, it increments the counter and records that value in the metadata...
We may want to add support to give up restarting from a checkpoint if it is attempted too many times in a row. Consider a case where some checkpoint file...
Originally, SCR_ENABLE was a way to turn off the SCR API from having any effect. It was most useful in the scripts, but in the API it essentially turned all...
Support more general node naming schemes in scr_hostlist processing. In particular merge extensions from Intel and those required for SDSC. Obtain requirements from ORNL while we're at it.
Because C provides a standard ABI, libscr can be compiled once and then linked to an app built with any compiler. However, we must compile libscrf for each Fortran compiler....
Add test cases to ensure that users can specify CNTL and CACHE directories via config files as mentioned in https://github.com/LLNL/scr/pull/109
Want to verify that things work in at least the following cases: 1) job ends normally (no failure) 2) job exits early while an async flush was ongoing and it...
If using scr_srun and the like, the user will need the install path corresponding to the executable they're using. That could be a pain. Perhaps a single system install of...