arrow-julia
arrow-julia copied to clipboard
ACCESS_VIOLATION reading Arrow record batch
I get this access violation when I read the attached Arrow file. I suspect there is an empty record batch in the Arrow file:
Please submit a bug report with steps to reproduce this fault, and any error messages that follow (in their entirety). Thanks.
Exception: EXCEPTION_ACCESS_VIOLATION at 0x5fb26056 -- uncompress at .\boot.jl:0
in expression starting at E:\Source\PIPCore.jl\src\Examples\browsearrow.jl:11
uncompress at .\boot.jl:0
buildbitmap at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:416
build at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:566
unknown function (ip: 000000005fb28cc4)
build at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:402
unknown function (ip: 000000005fb28158)
iterate at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:378
unknown function (ip: 000000005fb21d3d)
copyto! at .\abstractarray.jl:847
_collect at .\array.jl:608 [inlined]
collect at .\array.jl:602 [inlined]
macro expansion at C:\Users\kobus.herbst\.julia\packages\Arrow\PQ5Mm\src\table.jl:288 [inlined]
#82 at .\threadingconstructs.jl:169
unknown function (ip: 000000005fb22363)
jl_apply at /cygdrive/c/buildbot/worker/package_win64/build/src\julia.h:1703 [inlined]
start_task at /cygdrive/c/buildbot/worker/package_win64/build/src\task.c:839
Allocations: 27852364 (Pool: 27841941; Big: 10423); GC: 29
This is the code:
using Arrow
using DataFrames
using TableBrowse
function browsearrow(path::String, file::String)
df = Arrow.Table(joinpath(path,"$(file).arrow")) |> DataFrame
#TableBrowse.browse(df)
println("Rows $(nrow(df))")
end
browsearrow("D:\\Data\\Demography\\AHRI\\DayExtraction","IndividualResidencyDays_batched")
IndividualResidencyDays_batched.zip The Arrow is in the zip archive