scalding icon indicating copy to clipboard operation
scalding copied to clipboard

WritePartitioner step count law fails sometimes

Open johnynek opened this issue 7 years ago • 1 comments

We really want O(1) steps per partition, since we want to make sure cascading can plan it fast, but still we don't understand why the law is failing. It could be our model of how cascading plans is off.

Here is a failure:

[info] - When we break at forks we have at most 2 + hashJoin steps *** FAILED ***
[info]   TestFailedException was thrown during property evaluation.
Reporter completed abruptly with an exception after receiving event: TestFailed(Ordinal(0, 1187),TestFailedException was thrown during property evaluation.
  Message: 4 was not less than or equal to 3 optimized: WithDescriptionTypedPipe(TrappedPipe(WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(HashCoGroup(WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(Mapped(CoGroupedPipe(MapGroup(Pair(IdentityReduce(scala.math.Ordering$Int$@7ab78f27,WithDescriptionTypedPipe(FlatMapValues(Mapped(HashCoGroup(SourcePipe(com.twitter.scalding.source.FixedTypedText(source_0)),IdentityReduce(scala.math.Ordering$Int$@7ab78f27,WithDescriptionTypedPipe(MergedTypedPipe(FlatMapped(IterablePipe(List(0, 2147483647, 1544762017, 0, 624052887, 0, -287604905, 307396159, -610267734, -971337476, -875627901, 1, 2147483647, -1828436720, -2147483648, -1846923827, -1938708654, -1, 1, -1880693903, -123621097, -2147483648, 840086264, -1, 716705883, -2147483648, -1237321546, -2018664413, -471628389, -1327949610, 2147483647, 1470526490, -1650836342, 915790024, 1772020311, -615139303, -2114954200, -673076699, -601159290, 183995577, 1194220429, -1, 2056294449, -560847744, 0, 0, -2147483648, 1843138917, -2056707332, 324239878, 2038838732, 1278259345, 1120086645, 2147483647, 1904161096, 1, 0, 0, -2147483648, -1165218113, 209432443, 0, 1947467572, 1, 2147483647, 1242019707, 2147483647, 2072354530, 1709997299, -2147483648, -1908641464, 1781430029, -1, 265802944, 2147483647, 1380531264, -433867587, 0, -17033375, -2147483648, -1443528651, 1, 745727866, -845251200, 1, -1490936578, -1184499549, 0, 1672666737)),<function1>),FlatMapped(IterablePipe(List(1, -51928547, 1, -302353733, -181736864, -2147483648, -571666277, -1, -675270052, 36854174, 1847828467, -2147483648, -2147483648, 237525651, 1, 178499055, -2147483648, 338940843, -1054752338, -1, 829014631, 0, 1830694228, 1364013977, 0, -1, -1455941074, -85033653, -2147483648, -1210275416, -1937417356, 0, 867753776, 0, -1685381654, 70745480, -1783259469, -1376740730, 0, 1451049565, 1009239461, -2147483648, -1, -1, 773067303)),<function1>)),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),None,List(),ReflexiveEquality()),<function3>),<function1>),<function1>),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),None,List(),ReflexiveEquality()),IdentityReduce(scala.math.Ordering$Int$@7ab78f27,WithDescriptionTypedPipe(SourcePipe(com.twitter.scalding.source.FixedTypedText(source_0)),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),None,List(),ReflexiveEquality()),<function3>),<function2>)),<function1>),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),<function1>),List((com.stripe.dagon.FunctionK$class.apply(FunctionK.scala:11),true))),IdentityReduce(com.twitter.scalding.typed.TypedPipe$CrossPipe$$anon$2@c0d067cd,WithDescriptionTypedPipe(WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(SourcePipe(com.twitter.scalding.source.FixedTypedText(dmscynvQ1el)),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),<function1>),List((com.stripe.dagon.FunctionK$class.apply(FunctionK.scala:11),true))),List((com.stripe.dagon.FunctionK$class.apply(FunctionK.scala:11),true))),None,List(),ReflexiveEquality()),<function3>),List((com.stripe.dagon.FunctionK$class.apply(FunctionK.scala:11),true))),<function1>),List((com.stripe.dagon.FunctionK$class.apply(FunctionK.scala:11),true))),<function1>),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true))),com.twitter.scalding.source.FixedTypedText(bvvG7hu51rucqPmfeleytvwep89DofrhhuKscdxO1rcopty5avyu7Fzrnpuq),Single(com.twitter.scalding.TupleGetter$IntGetter$@188ea6ee)),List((org.scalacheck.Gen$R$class.map(Gen.scala:237),true)))

johnynek avatar Feb 20 '18 18:02 johnynek

annother failure:

[info] - When we break at forks we have at most 2 + hashJoin steps
Reporter completed abruptly with an exception after receiving event: TestFailed(Ordinal(0, 1185),TestFailedException was thrown during property evaluation.
  Message: 4 was not less than or equal to 2
  Location: (WritePartitionerTest.scala:102)
  Occurred when passed generated values (
    arg0 = WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(CrossPipe(WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(CrossPipe(WithDescriptionTypedPipe(ForceToDisk(EmptyTypedPipe),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),WithDescriptionTypedPipe(Filter(WithDescriptionTypedPipe(Fork(WithDescriptionTypedPipe(Filter(WithDescriptionTypedPipe(Fork(EmptyTypedPipe),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),org.scalacheck.GenArities$$Lambda$441/597763888@64c4c3bd),List((org.scalacheck.Gen$R.map(Gen.scala:237),true)))),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),org.scalacheck.GenArities$$Lambda$441/597763888@36a7692b),List((org.scalacheck.Gen$R.map(Gen.scala:237),true)))),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),<function1>),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),WithDescriptionTypedPipe(Mapped(WithDescriptionTypedPipe(FlatMapped(IterablePipe(List(2147483647, -2147483648, -1945538304, -2147483648, -2147483648, -1369151060, -1913477688)),org.scalacheck.GenArities$$Lambda$441/597763888@27dabd65),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),<function1>),List((org.scalacheck.Gen$R.map(Gen.scala:237),true)))),List((org.scalacheck.Gen$R.map(Gen.scala:237),true))),<function1>),List((org.scalacheck.Gen$R.map(Gen.scala:237),true)))

johnynek avatar Feb 20 '18 18:02 johnynek