kafka-connect-hdfs
kafka-connect-hdfs copied to clipboard
it will create many small files?
Hi, everyone , recently i used hdfs sink connector, and i am worried about that if the data is consistenctly input to kafka, will it generate some small files, which are not good for hdfs namenode ?
Depends on your output format, flush size, and partitioner. It is possible to generate several GB files, in my experience