Adam Li comments

Results 492 comments of


                                            Adam Li

Specifying dim_contiguous causes seg faults in PatchObliqueForestClassifier

It seems this test works when n_estimators is low, but not when I ramp to `50` or maybe `100`. This makes me think there is a sampling edge case error...

Specifying dim_contiguous causes seg faults in PatchObliqueForestClassifier

I think the issue is isolated to when dim_contiguous=False you can test this with a single tree iterated over 100-1000 different random seeds. When dim_contiguous=True , seems there is consistently...

Need to track constant features in multi-view decision trees

This was first noticed while testing in https://github.com/neurodata/scikit-tree/pull/172

How to import the experimental functions?

`from sktree.experimental import mutual_info_ksg` works for me. FYI this submodule is quite experimental and untested

How to import the experimental functions?

Hmm hard to say, since I cannot reproduce said issue. Can you provide all the information required in a bug report? https://github.com/neurodata/scikit-tree/issues/new?assignees=&labels=bug&projects=&template=bug_report.md&title=

Implement canonical correlation forests

CCA = SVD + QR decomposiiton. So someone just needs to find a C++/Cython robust implementation we can link to and import within our Cython code. Maybe peruse scipy/numpy compiled-code...

Implement canonical correlation forests

As a first step, one can implement just a CCA computation in Cython and then write a `cpdef` wrapper to expose it in Python for testing. The second step would...

Implement GRF criterion with refactored internals that subclass RegressionCriterion

Some thoughts here in increasing order of complexity: ### Criterion The criterion we are trying to implement is a generalization that stems from the GRF paper: ``` # the general...

Implement GRF criterion with refactored internals that subclass RegressionCriterion

## Splitter When it comes to the splitter, the code is a bit more simple, as most of the computation is passed off to the Criterion object. However, this [splitter](https://github.com/py-why/EconML/blob/main/econml/tree/_splitter.pyx)...

Adding streaming trees for all oblique split classifiers

The commit here should enable the partial_fit without really too much effort: https://github.com/neurodata/scikit-learn/pull/54/commits/368df7a381b14952f9cee477a7e81b3768dcd2d0