cleora
cleora copied to clipboard
help in understanding output file format
Hi
I was running cleora using the command below:
cleora-v1.2.3-x86_64-apple-darwin --columns transient::cluster_id StarNode --dimension 1024 -n 5 --input fb_cleora_input_star.txt -o output
I got something similar to the following output: (I added some spacing just for better readability)
39361 1024
1 1 0.029419877 ..... -0.0073362226
16260 7 0.033474464 ..... -0.00906976
.
.
.
22459 1 0.010709517 ..... 0.026430061
I cant figure out what does the 1st (1, 16260, ..., 22459) and the 2nd (1, 7, ..., 1) columns represent?
Thanks
Hi @asafalina !
First column - entity. In your case it should be cluster_id.
Second column - occurrence, how many times entity occurs in the data.
https://github.com/Synerise/cleora/blob/master/src/persistence.rs#L44
Hope it helps.