arrow-julia icon indicating copy to clipboard operation
arrow-julia copied to clipboard

ACCESS_VIOLATION reading Arrow record batch

Open kobusherbst opened this issue 4 years ago • 0 comments

I get this access violation when I read the attached Arrow file. I suspect there is an empty record batch in the Arrow file:

Please submit a bug report with steps to reproduce this fault, and any error messages that follow (in their entirety). Thanks.
Exception: EXCEPTION_ACCESS_VIOLATION at 0x5fb26056 -- uncompress at .\boot.jl:0
in expression starting at E:\Source\PIPCore.jl\src\Examples\browsearrow.jl:11
uncompress at .\boot.jl:0
buildbitmap at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:416
build at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:566
unknown function (ip: 000000005fb28cc4)
build at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:402
unknown function (ip: 000000005fb28158)
iterate at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:378
unknown function (ip: 000000005fb21d3d)
copyto! at .\abstractarray.jl:847
_collect at .\array.jl:608 [inlined]
collect at .\array.jl:602 [inlined]
macro expansion at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:288 [inlined]
#82 at .\threadingconstructs.jl:169
unknown function (ip: 000000005fb22363)
jl_apply at /cygdrive/c/buildbot/worker/package_win64/build/src\julia.h:1703 [inlined]
start_task at /cygdrive/c/buildbot/worker/package_win64/build/src\task.c:839
Allocations: 27852364 (Pool: 27841941; Big: 10423); GC: 29

This is the code:

using Arrow
using DataFrames
using TableBrowse

function browsearrow(path::String, file::String)
  df = Arrow.Table(joinpath(path,"$(file).arrow")) |> DataFrame
  #TableBrowse.browse(df)
  println("Rows $(nrow(df))")
end

browsearrow("D:\\Data\\Demography\\AHRI\\DayExtraction","IndividualResidencyDays_batched")

IndividualResidencyDays_batched.zip The Arrow is in the zip archive

kobusherbst avatar Sep 20 '21 06:09 kobusherbst