franzi

Results 13 comments of franzi

I think this is going in the right direction, but what I think would be coolest is to have the option of what you call `page` and `collapsed` kind of...

Thanks for flagging this - I'll have a look & add some extra checks!

The time that the .transform() function needs scales with the number of features that are computed, not so much with the number of data points it is applied to (since...

you could probably parallelize the for loop, i.e., apply the transformations for each feature in parallel and then concatenate all the results and add them to the dataframe, but the...

From the StackOverflow question it seems the problem was because one of your original input features had a standard deviation of 0 (i.e., all the same values)? This would also...

Yes, randomness is used in a lot of places in the code, both explicitly in places you've mentioned as well as internally (e.g. in some of the models). And it...

Can you please give some more details about after which function call this happens and what your data looks like? Specifically: is your data a pandas dataframe with column names?...

Hey @fkhan00 : the main bottleneck is the feature generation using the sympy library (which computes the different feature formulas to make sure there are no redundant features being generated...

At the top [here](https://www.nltk.org/_modules/nltk/tokenize/casual.html) are some nice regexs, incl. this one for phone numbers: ```python r""" (?: (?: # (international) \+?[01] [ *\-.\)]* )? (?: # (area code) [\(]? \d{3}...

ok, I think this might work: `r"(?:^|(?