Lensman
Lensman
Will have to be careful of __m64 datatypes. Probably have to convert them to __128 equivalents, which may mean wasted ops unless algorithms are also converted to take advantage of...
Nothing is wrong with it, but intrinsics that support that datatype are not supported on x86_64, so it will mean hand coded assembly. I have converted hrendzsse to intrinsics, and...
I'm pretending I didn't see this one.
Just a quick note, if you want the C versionof these functions to run faster use this (inverse sqrt); inline float f_rsqrt( const float number ) { long i; float...
You don't need the second iteration of the Newton-Raphson approximation. It's adequate with one iteration in the renderer as the inputs are quantized from an original full precision sqrt, and...
That's right, it's the infamous code from quake that has had whole articles written about it. The intrinsic is the reciprical sqrt which you will find in the v/hrend(z)sse part...
I'll tidy up a bit, and push so you can have a look. When I say memory bound, in this instance, the renderer is trying to take advantage of lookup...
As far as intrinsics go, any intrinsics that use the __m64 datatype are not supported on x86_64, that's not to say that you can't use mmx registers in assembly. It...
I've started using use groupings provided in Voxlap.h to break it up into chunks. I'm going to have to do some commercial work for a week so my activity will...
I'll do these splitting tasks first. Voxlap.cpp with ASM as 8000+ lines for instance. We are settting our selves up for trouble using IFDEFS, but we can turn this into...