thecppzoo

Results 16 comments of thecppzoo

I believe the essay is ready at this point. I ended up digressing and expanding points but hopefully without harming the main point.

Please prove the claim there's no advantage to using the older mechanism, for all cases.

Perhaps we can also make "widening multiplication", that doubles the lane size. For example, in x86-64, there are the instructions to multiply two register-size values and get a result of...

I can not resist to comment about how elegant this is all looking. The primitives of doubling/halving precision were a success.

This implementation might be simple enough, sure, but it can only accept lane sizes that have a power of two number of bits. Let's review if the implementation I made...

> @thecppzoo https://godbolt.org/z/5jdfffb1M I just did this: https://godbolt.org/z/cE1eoKM3d I am very surprised and disappointed that the generated code for powers of two is basically identical, we have now a good...