Robert van de Geijn
Robert van de Geijn
I’d like to be part of that discussion > On Aug 29, 2021, at 10:48 AM, Devin Matthews ***@***.***> wrote: > > > @fgvanzee let's talk about this; IIRC carouseling...
Jacob, Investigating bfloat16 is on our priority list. We are waiting for word on funding from a sponsor, which may bump it higher on the priority list. Robert
Sounds like ARM should sponsor this effort, so we can bump it up on our priority list! :-). Thank you for sharing.
Great project for an undergrad? > On Sep 24, 2021, at 3:19 PM, Devin Matthews ***@***.***> wrote: > > > Looks like you have to use the PMU, so PAPI...
I introduced invscal in libflame, and that operation is now in BLIS as well. It performance x := 1/alpha x. Much cleaner than passing 1/alpha into scal. About 10 years...
I am not one of the developers, although I have worked with them for many years. They are brilliant, but there are techniques that can be shared. May I suggest...
Thank you for reporting. I have fixed the link. When you get to GitHub, it gives a "Unable to render code block" error, but you can then download the file...
I think the point Nick is making: the implementation in a BLAS library is likely to be higher performing than the reference implementation in LAPACK or ScaLAPACK (or some other...