jaeger-clickhouse icon indicating copy to clipboard operation
jaeger-clickhouse copied to clipboard

Decide on sharding function for distribbuted table

Open pavolloffay opened this issue 2 years ago • 2 comments

The distributed table could be created with multiple sharding functions: rand(), cityHash64(traceID) - see https://clickhouse.tech/docs/en/sql-reference/functions/hash-functions/.

The hash functions take an argument, we should consider using traceID to keep data from a single trace in the same location.

CREATE TABLE IF NOT EXISTS jaeger_spans AS jaeger_spans_local ENGINE = Distributed('{cluster}', default, jaeger_spans_local, cityHash64(traceID));

pavolloffay avatar Jul 27 '21 10:07 pavolloffay

@chhetripradeep would you like to take this on and run some tests on your deployment?

pavolloffay avatar Jul 27 '21 10:07 pavolloffay

Sure.

chhetripradeep avatar Jul 27 '21 11:07 chhetripradeep