Ben Sadeghi

Results 40 comments of Ben Sadeghi

@cstjean Yes, changing the values field from an array of labels to say a Dict of counts is the right way to go. My concern is with the AdaBoost routines,...

@cstjean I started a new type which includes a Dict of counts, and updated build_tree() to take advantage of it: ```Julia immutable LeafC majority::Any counts::Dict{Any,Int64} end ``` Surprisingly, from my...

@opterix There have been some progress on this front with the DecisionTree v0.8.1 release (requires Julia v0.7-v1.0). The native data types `Node` and `Leaf` are now typed, and so you'll...

@ppalmes "Feature Importance" capabilities have not been implemented. But please feel free to code up an implementation and submit a pull request. Thanks

This sounds like a great idea, and let's thank Iain for sorting out the reduction scheme. Unfortunately I don't have much free time right now, but feel free to implement...

Could you please provide steps to reproduce this fault? As is, it's difficult to troubleshoot the issue with this error trace.

I'm not sure if that is currently possible. Upon adding new processors, the package functions (ie build_tree) need to be sent over to them. So something like this returns a...

Unfortunately, there is currently no support for missing values. This is on the roadmap, but has yet to be implemented. One option is to remove the samples/rows which contain missing...

@rpoplin Here are some guidelines for dealing with missing values: http://people.eecs.ku.edu/~jerzy/b24-miss.pdf @kmsquire Yeah, perhaps it's time to have DataFrames as a dependency and handle NAs properly. I'll try to have...

@Eight1911 Great to hear about your implementations of GradientBoost and AdaBoost! And yes, totally agree with you that the current `Leaf` struct is unnecessarily bloated and wasteful when it comes...