Aurélien Geron
Aurélien Geron
Hi @KhidirA , If I understand correctly you were confused about the arguments to the `pd.read_csv()` function in chapter 1: ```python oecd_bli = pd.read_csv(datapath + "oecd_bli_2015.csv", thousands=',') gdp_per_capita = pd.read_csv(datapath...
That's a great point, thanks @julienvos . The `ColumnTransformer` didn't exist when I wrote the book, so things were a bit ugly back then. You are right that we could...
Good question @anyuese . Because `cv=3`, the `cross_val_predict()` function will split the dataset into 3 distinct parts (called "folds"), then it will create 3 clones of the `sgd_clf`, and it...
Yes, KNN can be very slow. Try running the code on 1/10th of the dataset to see if it runs smoothly. Normally the cross val functions should be about close...
Hi @qy-yang , Great question! I haven't checked, but I suppose these are all the *distinct* scores.
Hi @lvnilesh , Thanks for this PR. However, I'm confused: Matplotlib is already part of Anaconda, so why do we need to pip install it?
Hi @cmcgrath1982 , The problem may come from the definition of the `DataFrameSelector` class, or the `CombinedAttributesAdded` class. Make sure their `fit()` and `transform()` methods both take two arguments: `X`...
You need the latest version of Scikit-Learn: 0.20.3 ``` pip3 install -U scikit-learn ```
Hi @229539687 , thanks for your feedback. You probably have an older revision of the book. You can check which release you have on the page immediately before the table...
Hi @yashGuleria , Thanks for your question. This `DataFrameSelector` class was a custom class. It has to be defined as indicated in the book, if you have one of the...