incubator-gluten icon indicating copy to clipboard operation
incubator-gluten copied to clipboard

[VL] Result mismatch issues Tracker

Open zhouyuan opened this issue 1 year ago • 42 comments

Backend

VL (Velox)

Bug description

There are several data mismatch issues either related with operator or functions. Some of the fixes are landed in Gluten, and some are in Velox repo. We will use this issue to track the status as these are critical for production envs.

  • [x] complex datatype return wrong value, disabled in Gluten now
  • [x] parquet scan + filter pushdown wrongly return "", should return null.
  • [x] distinct hash agg + spill returned duplicated keys.
  • [x] max_by function return wrong result
  • [x] sortBeforeRepartition
  • [ ] cast(sum(decmial(20,4)), float)
  • [x] get_json_object({"dScore":0.0215434648799772}, "$.dScore") (Fixed in Velox)
  • [ ] cast(string as bigint)
  • [x] cast(double as decimal)
  • [ ] array_size(null) (issue: https://github.com/apache/incubator-gluten/issues/5248)
  • [x] round(avg(cast(col as double)), 4) #5366
  • [x] isNull and isNotNull in filter condition #5670
  • [x] from_unixtime with overflowed (Fixed by https://github.com/facebookincubator/velox/pull/9836)
  • [ ] date_format (https://github.com/apache/incubator-gluten/issues/5524)
  • [x] cast integer as binary (https://github.com/apache/incubator-gluten/issues/5073)
  • [ ] regexp_replace('a{bc', '\{', '\[') (#6224)
  • [ ] LEGACY timeParserPolicy (#6227)
  • [ ] FlushableAgg (#6630)
  • [ ] if expression (#6673)
  • [ ] weekOfYear (#6784)
  • [ ] round (#6827)
  • [ ] cast string as date (#6828)
  • [ ] nested decimal arithmetic expressions (#7082)
  • [ ] date format week year (#7069)
  • [ ] Large timestamp outside of range (#7109)
  • [ ] Aggregate window gets the wrong result (#7194)
  • [x] Diff of in_or_and (#7362)
  • [ ] hash agg output wrong result (#7494)

zhouyuan avatar Feb 05 '24 08:02 zhouyuan

#4678 issue in hashagg

FelixYBW avatar Feb 08 '24 18:02 FelixYBW

https://github.com/oap-project/gluten/issues/4587

Currently we disabled all complex data read

FelixYBW avatar Feb 26 '24 21:02 FelixYBW

https://github.com/oap-project/gluten/pull/4818

zhouyuan avatar Mar 05 '24 05:03 zhouyuan

https://github.com/oap-project/gluten/pull/4872

zhouyuan avatar Mar 07 '24 03:03 zhouyuan

https://github.com/apache/incubator-gluten/issues/4891

kecookier avatar Mar 08 '24 05:03 kecookier

https://github.com/apache/incubator-gluten/issues/4928

kecookier avatar Mar 12 '24 05:03 kecookier

https://github.com/apache/incubator-gluten/issues/4930

kecookier avatar Mar 12 '24 05:03 kecookier

https://github.com/apache/incubator-gluten/issues/4947

kecookier avatar Mar 13 '24 08:03 kecookier

3 issues we met:

  1. parquet scan + filter pushdown wrongly return "", should return null. Fixed by https://github.com/facebookincubator/velox/pull/9129
  2. distinct hash agg + spill returned duplicated keys. https://github.com/facebookincubator/velox/issues/9219
  3. max_by function return wrong result

FelixYBW avatar Mar 20 '24 02:03 FelixYBW

  1. distinct hash agg + spill returned duplicated keys.

@FelixYBW Has this issue not been fixed by https://github.com/apache/incubator-gluten/pull/4443 ?

ulysses-you avatar Mar 20 '24 02:03 ulysses-you

@FelixYBW Has this issue not been fixed by #4443 ?

No, it's tested from main branch. A new issue

FelixYBW avatar Mar 20 '24 02:03 FelixYBW

No, it's tested from main branch. A new issue

https://github.com/facebookincubator/velox/issues/9219

FelixYBW avatar Mar 22 '24 20:03 FelixYBW

  1. max_by function return wrong result

@yma11 Did you submit a fix to the issue?

FelixYBW avatar Mar 22 '24 20:03 FelixYBW

  1. max_by function return wrong result

@yma11 Did you submit a fix to the issue?

Not yet. Only have pushed to golden branch and will submit one in Velox upstream.

yma11 avatar Mar 23 '24 04:03 yma11

#5253

NEUpanning avatar Apr 02 '24 04:04 NEUpanning

#5253

Looks the issue of get_json_object. @PHILO-HE maybe we need a fully tests of json functions, like the re2.

FelixYBW avatar Apr 02 '24 16:04 FelixYBW

#5253

Looks the issue of get_json_object. @PHILO-HE maybe we need a fully tests of json functions, like the re2.

@FelixYBW, I will do that. Thanks!

PHILO-HE avatar Apr 03 '24 00:04 PHILO-HE

https://github.com/apache/incubator-gluten/issues/5248

FelixYBW avatar Apr 03 '24 12:04 FelixYBW

https://github.com/apache/incubator-gluten/issues/5366

kecookier avatar Apr 11 '24 07:04 kecookier

#5366

UPdated desc. thank you. do you know which function (cast, avg, round ) caused the issue?

FelixYBW avatar Apr 11 '24 16:04 FelixYBW

#5372

FelixYBW avatar Apr 11 '24 19:04 FelixYBW

  1. max_by function return wrong result

@yma11 Did you submit a fix to the issue?

Not yet. Only have pushed to golden branch and will submit one in Velox upstream.

@FelixYBW This fix should be done at cpp side. The formal fix is in PR. Can you help review it?

yma11 avatar Apr 28 '24 12:04 yma11

@FelixYBW This fix should be done at cpp side. The formal fix is in PR. Can you help review it?

Is it a Gluten issue? I'd think veox has some bug here.

FelixYBW avatar Apr 29 '24 17:04 FelixYBW

@FelixYBW This fix should be done at cpp side. The formal fix is in PR. Can you help review it?

Is it a Gluten issue? I'd think veox has some bug here.

Yes. I think so. It's caused by the additional projects we added before/after shuffle. The logic of partial/final handle in Velox upstream has no problem. The ideal way is to add struct support for shuffle in Gluten so that we can remove the hack.

yma11 avatar Apr 30 '24 00:04 yma11

@PHILO-HE Any update of the issues here?

FelixYBW avatar May 07 '24 17:05 FelixYBW

https://github.com/apache/incubator-gluten/issues/5682

zjuwangg avatar May 10 '24 04:05 zjuwangg

#5701

NEUpanning avatar May 11 '24 11:05 NEUpanning

@PHILO-HE Any update of the issues here?

@FelixYBW, Some were actually fixed. Just updated the list. Will fix or seek help to fix other issues.

PHILO-HE avatar May 13 '24 02:05 PHILO-HE