Jason Dai

Results 2 issues of Jason Dai

In Spark, it is common practice (and usually preferred) to launch a large number of small tasks, which unfortunately can create an even larger number of very small shuffle files...