StrideArrays.jl icon indicating copy to clipboard operation
StrideArrays.jl copied to clipboard

Error when broadcasting across singleton dimension

Open johnbcoughlin opened this issue 2 years ago • 4 comments

Using the latest version, I get garbage when broadcasting across a singleton dimension:

pkg> st StrideArrays
...
  [d1fa6d79] StrideArrays v0.1.25
julia> f = @StrideArray rand(3, 3);

julia> v = LinRange(-5, 0, 3) |> collect;

julia> ^C

julia> f .* reshape(v, (1, :))
3×3 StrideArraysCore.StaticStrideArray{Float64, 2, (1, 2), Tuple{StaticInt{3}, StaticInt{3}}, Tuple{Nothing, Nothing}, Tuple{StaticInt{1}, StaticInt{1}}, 9} with indices static(1):static(3)×static(1):static(3):
 -0.416778  -0.925133      0.0
 -0.864525   0.0           6.65141e-315
  0.0        1.05309e-314  6.31801e-314

julia> f .* reshape(v, (1, :)) == Array(f) .* reshape(v, (1, :))
false

Note the numerical garbage in the trailing rows, which seems to come from uninitialized memory. This also occurs when broadcasting in-place; the destination will end up with stuff from uninitialized memory.

I think I have enabled boundschecking, via

StrideArraysCore.boundscheck() = true

johnbcoughlin avatar May 26 '23 19:05 johnbcoughlin

This is a LoopVectorization broadcasting issue. When contiguous axis are dynamically broadcasted, the behavior is undefined.

That is, when an axis that would normally be contiguous in memory is of size 1 but that this is not known through the type system, while the corresponding axis in other arrays is not of size 1, this results in undefined behavior.

A PR to LV to fix this would be welcome. It should be relatively straight forward, e.g. in vmaterialize!, check if this is the case, and if so fall back to some slow method. You could also generate code for an optimized case where linear indexing works.

You could look to the source of FastBroadcast.jl for ideas.

chriselrod avatar May 30 '23 15:05 chriselrod

I've run into this as well. How do I need to restrict the versions of Loopvectorisation and Stridearrays to avoid this?

Or at least I think my problem is the same??

using StrideArrays, LinearAlgebra
_A = zeros(10,10)
A = PtrArray(_A)
A[:, :] .= randn(10,10)
b = randn(10)
d = sum( b[j] * A[1,j] for j = 1:10 )
@show        dot(A[1, :], b) ≈ d   # false
@show dot((@view A[1,:]), b) ≈ d   # false
@show      sum(A[1,:] .* b ) ≈ d   # false
a1 = collect(@view A[1,:])
@show          sum(a1 .* b ) ≈ d   # true

cortner avatar Jul 10 '23 22:07 cortner

Or at least I think my problem is the same??

No, actually. Fixed with LoopVectorization 0.12.163 + StrideArraysCore 0.4.17.

chriselrod avatar Jul 11 '23 03:07 chriselrod

fantastic, thank you!

cortner avatar Jul 13 '23 04:07 cortner