accumulo-testing icon indicating copy to clipboard operation
accumulo-testing copied to clipboard

Explore writng a specialized summarizer for bulk ingest

Open keith-turner opened this issue 6 years ago • 0 comments

To debug a recent bulk ingest test I wrote the following summarizer. This summarizer counted the number of times each UUID was seen. I used to count the number of entries each map reduce job had created.

package test.ci;

import org.apache.accumulo.core.client.summary.CountingSummarizer;

public class CiUuidSummarizer extends CountingSummarizer<String>{

  @Override
  protected Converter<String> converter() {
    return (k,v,c) -> c.accept(v.toString().split(":")[0]);
  }

}

keith-turner avatar Jul 18 '19 19:07 keith-turner