Evan Weinberg
Evan Weinberg
Per discussion in #1056 , after the GK merge we should decide on a consistent naming convention for `Float` types, etc, used throughout the library, and make the global change....
There's currently an all-to-all of to, from precisions for the MG `copy_color_spinor_mg.*` files. Historically this split was done for compile time reasons. We should investigate if this is still a...
The routines to: * Add the identity to the coarse clover * Add a staggered mass to the coarse clover * Add a twist to the coarse clover Are all...
Revert #1152 (or re-write the files as appropriate) once the multi-rhs work is done.
Right now it saves a `double` precision gauge field regardless of the input precision---I need to fix that.
Described in title
Instead of using a traditional implementation of classical or modified Gram-Schmidt (or a hybrid thereof), (block-)orthonormalization can be formulated as a thin QR, which is implemented in practice via a...
* Modify dirac_[improved_]staggered.cpp to use the full operator for calling `MdagM` as opposed to separate even/odd parts. In theory this does the right thing: ``` Dslash(*tmp1, in, QUDA_INVALID_PARITY); DslashXpay(out, *tmp1,...
The routine `computeCoarseClover`: https://github.com/lattice/quda/blob/develop/include/kernels/coarse_op_kernel.cuh#L1014 Does not exploit a huge amount of parallelism as implemented, which turns into a bit of a nightmare when autotuning and could be a blocker in...
Double, recon 12 sees a boost. Half, recon 8 sees a regression. I don't have an apples-to-apples comparison for single (different recons), but they're included for posterity. ### With dynamic...