Trane
Trane copied to clipboard
An open source python library for automated prediction engineering
- Add integration tests with Covid, Bike, and Chicago examples - Removed the following files and folders: - `.idea` - `Example/medical_no_show.csv` - `trane-examples/` - `taxi_data/` - `Trane-Demos/` - `flight-delay/` -...
- Fix all docstrings. - Evaluator and FeatureToolsWrapper - Give a clear explanation about - Fix Load and Store for prediction problems - Used to store executable code for cutoff...
Trane needs an easier way to add customize operations. Currently, external plugin operations are not allowed. The bottleneck is we need to maintain a list of operations so that we...
[`trane.utils.generate_nl_description`](https://github.com/HDI-Project/Trane/blob/master/trane/utils/generate_nl_description.py#L11) currently assumes exactly 4 operations in the order filter -> row -> transformation -> aggregation. It should be able to handle multiple or no operations in any category. Including...
We need to come up with an accurate and understandable name for the values that operations use to perform execution. For instance, a greater than row operation requires a value...
Right now, trane's natural language system is a part of `PredictionProblem.__str__`. It may be better to have it be entirely outside of Trane, and to use descriptions of operations
Dependent on https://github.com/trane-dev/Trane/issues/140
```python def find_threshold_to_maximize_uncertainty( self, df, label_col, entity_col, max_num_unique_values=10, max_number_of_rows=2000, random_state=None, ): original_threshold = self.threshold unique_vals = sample_unique_values( df[label_col], max_num_unique_values, random_state, ) # if len(df) > max_number_of_rows: # df = df.sample(max_number_of_rows,...