John
John
There's something semantically unclear here---the log_multi reduction is made to make choices in time O(log(number of classes) ) but --probabilities inherently requires O(number of classes) computational time. So implementing something...
There may be something that can be done here without damaging computational complexity. However, there is a complication: --log_multi trains individual nodes of the tree in a manner which is...
Does --plt work for you instead? (This is a recent addition.) This provides sublinear prediction time for multilabel classification using a proper scoring rule, so the scores would make more...
The essential issue here is that the error stream needs to be redirected into the report stream in daemon mode. VW isn't setup to do this easily. The simplest clean...
This is definitely a "patches welcome" kind of issue.
@jmorra @deaktator your thoughts here?
I'm unsure, but several more people are becoming involved so it's possible we can engineer a more comprehensive release process.
Is there evidence that this is helpful in practice? I'm not sure about whether or not disabling things is helpful in practice. For sure, the logic behind the updates does...
@kkchaitu27 , specifying the action dependent features for continuous actions seems nonviable since the size of an input example would be effectively infinite---you need to specify feature values for _every_...
I think I understand now. You want to be able to constrain the range of actions for individual events, but not to specify the features of every action (which could...