cleora icon indicating copy to clipboard operation
cleora copied to clipboard

help in understanding output file format

Open asafalina opened this issue 3 years ago • 1 comments

Hi

I was running cleora using the command below:

cleora-v1.2.3-x86_64-apple-darwin --columns transient::cluster_id StarNode --dimension 1024 -n 5 --input fb_cleora_input_star.txt -o output

I got something similar to the following output: (I added some spacing just for better readability)

39361 1024
1        1    0.029419877 ..... -0.0073362226
16260    7    0.033474464 ..... -0.00906976
.
.
.
22459    1    0.010709517 ..... 0.026430061

I cant figure out what does the 1st (1, 16260, ..., 22459) and the 2nd (1, 7, ..., 1) columns represent?

Thanks

asafalina avatar Oct 16 '22 20:10 asafalina

Hi @asafalina !

First column - entity. In your case it should be cluster_id. Second column - occurrence, how many times entity occurs in the data.

https://github.com/Synerise/cleora/blob/master/src/persistence.rs#L44

Hope it helps.

piobab avatar Oct 21 '22 15:10 piobab