jaeger-clickhouse
jaeger-clickhouse copied to clipboard
Decide on sharding function for distribbuted table
The distributed table could be created with multiple sharding functions: rand()
, cityHash64(traceID)
- see https://clickhouse.tech/docs/en/sql-reference/functions/hash-functions/.
The hash functions take an argument, we should consider using traceID
to keep data from a single trace in the same location.
CREATE TABLE IF NOT EXISTS jaeger_spans AS jaeger_spans_local ENGINE = Distributed('{cluster}', default, jaeger_spans_local, cityHash64(traceID));
@chhetripradeep would you like to take this on and run some tests on your deployment?
Sure.