Daniil Ivanik

Results 7 comments of Daniil Ivanik

`SELECT _file, count() FROM s3('https://s3-us-west-1.amazonaws.com/umbrella-static/top-1m-2018-01-1*.csv.zip :: *.csv', 'NOSIGN') GROUP BY _file ORDER BY _file; ` |file | count()| | --------------------------- | ------ | | top-1m-2018-01-11.csv.zip | 1000000 | | top-1m-2018-01-12.csv.zip...

Hi, @taiyang-li. I will be glad to review your PR. Could you, please, merge master and resolve conflicts to relaunch the CI system? Thanks in advance!

@taiyang-li, could you explain, please, why we can't use bzip2 library and need our own implementation of bzip2 decompression in SplittableBzip2ReadBuffer.cpp?

> In hadoop, single bzip2 compressed file is splitted into serveral splits, and each task processes one split, which means Hadoop MR could process single bzip2 file in parallel, especially...

Hi, @wudidapaopao. I would be glad to review your feature. Could you, please, merge master?

I think, that you can put the content from **Documentation entry for user-facing changes** as you can write. Could you please remove it from this PR (I think that CI...