David Schaich
David Schaich
May take some work, but should give worthwhile performance boost --- good first step would be to check benefits in MILC's [staggered multi-CG using QUDA](https://github.com/milc-qcd/milc_qcd/blob/master/generic_ks/d_congrad5_fn_gpu.c)...
This should be a straightforward (if minor) performance improvement for I/O: In */generic/io_lat.c, there are currently lots of `NUMLINK * sizeof(fmatrix)` multiplications inside loops over the lattice volume, which can...
The current method of reading in runtime parameters from the command line seemed to lead to some trouble interfacing with performance monitoring tools. It may be more modern to read...
To help users keep track of how much memory is really being used, the usage due to the following FIELD_ALLOCS should be reported along the lines of the main fields...
Currently the eigensolver works with PRIMME v1.2.2. Changes to the PRIMME interface since then are currently incompatible with eig.c/complex.h.
Once the truncated links are deprecated we can implement a bit more code reuse. A bit more will also be allowed by implementing 2d_Q04 dimensional reduction center stabilization, as well...
This is mainly cosmetic, but becoming more worthwhile as the number of Twist_Fermion routines grows.
Is there a way to rearrange the stochastic mode number computation so that it does the inversions for all Omega simultaneously? Currently we have separate calls to the multi-shift solver...