TopoPyScale icon indicating copy to clipboard operation
TopoPyScale copied to clipboard

`point_id`, `cluster_labels` and `ID_col` are all confusing and create problem.

Open ArcticSnow opened this issue 1 year ago • 0 comments

I am running into problem with the confusion around the definition and usage of the variables df_centroids.point_id, df_centroids.cluster_labels and the implementation of non numeric point_id.

I think this must be clarified: So df_centroids is the central table keeping track of the point at which downscaling occurs. These points have a name, lat, lon and many other attributes.

  • point_id: originally was loosely defined as the index of each point in the table, as well as a surrogate of a name of point. It will be split in 2:
    • point_name that is a string,
    • point_ind that will be an integer corresponding to the table index df_centroids.index
  • cluster_labels: output of the kmean algo. This is an integer.

ArcticSnow avatar Jan 23 '24 12:01 ArcticSnow