CloudForest icon indicating copy to clipboard operation
CloudForest copied to clipboard

Optimization for numerical features with few values.

Open ryanbressler opened this issue 12 years ago • 0 comments

Add a new numeric feature type (and detect such features on data load) that uses a pre stored list of all the distinct values instead of sorting on each split.

I suspect this will be faster for sparse features, or feature types like hamming scat that have mostly one value and for ordinal features with few values.

It could also support optimized mode finding for ordinal regression.

ryanbressler avatar Oct 10 '13 19:10 ryanbressler