Arnaud A

Results 2 issues of Arnaud A

Very limited proof of concept PR to wrap around LAPACK's `dpstrf` and carry out the Cholesky Factorization of PSD matrices. In its current state of `ndarray_linalg` does not handle this...

I think I am missing something when going over the Triton implementation of LayerNorm in the tutorial. In the forward, we compute the mean as: ```python _mean = tl.zeros([BLOCK_SIZE], dtype=tl.float32)...