FedericoTrifoglio
FedericoTrifoglio
Is there a way to change the colour of the progress bar in handler_progress? Or another handler? Possibly something like this? ` handler_progress(format = ":spin :current/:total (:message) [:bar] :percent in...
Since the Mahalanobis distances are compared to a critical value using a [Chi-Squared distribution](https://github.com/dvgodoy/handyspark/blob/0fb4c8707b347470b508eac22e76df857dae7463/handyspark/sql/dataframe.py#L397), should this method only be used if the columns are all Normally distributed?
In chapter 3 (p, 96) see how a 90% precision classifier can be created. What about a 90% recall classifier? One would be tempted to do (at least I was)...
Given this pipeline ``` num_cols = ['longitude', 'latitude', 'housing_median_age', 'total_rooms', 'total_bedrooms', 'population', 'households', 'median_income'] cat_cols = ['ocean_proximity'] num_transformer = Pipeline(steps=[ ('impute', SimpleImputer(strategy='median')), ('add_feats', CombinedAttributesAdder(add_bedrooms_per_room=True)), ('scaler', StandardScaler()) ]) cat_transformer = Pipeline(steps=[...
Hi, chaid_table.R has been really helpful, but when creating node_table, only filtering complete cases may result in 0 frequencies for the outcome levels (if each case has at least one...