Damien L-G
Damien L-G
Please post atomic_{fetch_op, op_fetch} timings before and after as well.
OK then please address Maarten's comments and we should be good.
Please add this to the 5.0 changelog and update the documentation as appropriate.
I only looked at one somewhat random build (Fedora+HPX+RelWithDebInfo) in the CI and the benchmark takes 90s more than what I could see in another PR. I am not thrilled...
> Each scalar type takes about 8s to run, and there are 16 such tests, for a total of about 2 minutes of additional tests. @crtrott please confirm that you...
You could post the diff with a patch that adds error handling on the host side for posterity.
I am glad you are looking Stan because the big elephant in the room will be execution spaces... Do you store these as static data member like you were doing...
We need to add a constructor and an `init()` overload that take an execution space argument. > If it's a simple matter of adding new constructors, I'd be willing to...
> @dalg24 do we have a clear idea of how we name these though? Some of the ARM ones are just `ARMVX_PRODUCT` others are `ARMVX`? On the other hand most...
Retest this please (worried about the format)