Oscar Smith comments

Results 353 comments of


                                            Oscar Smith

BFloat16(::BigFloat) and BigFloat(::BFloat16)

the BFloat16(::BigFloat) version of this will differ double rounding. I suggest looking at how the float16 conversation works in Julia

BFloat16(::BigFloat) and BigFloat(::BFloat16)

yes, but getting the elementary functions rounded correctly isn't a requirement (or easy), but correct rounding for arithmetic and conversion are relatively easy.

BFloat16(::BigFloat) and BigFloat(::BFloat16)

as previously mentioned this version has double rounding, which can be avoided by using the same algorithm Float16 uses, but this is better than not having the capability

A different approach to compute besselk in intermediate zones

Isn't this just 16 pairs of evalpolys of degrees 1 to 16?

A different approach to compute besselk in intermediate zones

Yeah, my suggestion was the hybrid of this looped verison with your unrolled version (where you precompute `cst` but otherwise leave it looped). I think that should vectorize well and...

A different approach to compute besselk in intermediate zones

in terms of speed, the generated function + nexprs approach will be really good. and I don't think the cache pressure/load times will be much worse than the vectorized evalpolys...

A different approach to compute besselk in intermediate zones

Can the Implicit Function theorem be used here?

Use better sin_sum for F32

is it also slower in the range where the old one was accurate?

Use better sin_sum for F32

should this be merged or is it outdated?

Scope of Bessels.jl and inclusion of other special functions (e.g. gamma, airy, scorer)

One thing worth noting is that although these techniques won't work for `Float64`, for `Float32`, there are a lot of options opened since you can do the internals in `Float64`...