tsflex icon indicating copy to clipboard operation
tsflex copied to clipboard

Stuck on calculating features

Open cameron-hobbs opened this issue 1 year ago • 8 comments

cam@DESKTOP-41QNH42:/mnt/c/Users/ahobb/PycharmProjects/cammm$ poetry run python cammm/feature_eng/extract.py 99%|████████████████████████████████████████████████████████████████████████████████ | 273/276 [00:19<00:00, 98.37it/s]

Hi, calculating features is getting fully stuck (I think on acquiring a lock), it was running fine before with the exact same data and exact same code but I keyboard interrupted now it gets stuck whenever I try to run again. Here is some of the code:

   df = pd.read_csv(path)
   df["receipt_timestamp"] = pd.to_datetime(df["receipt_timestamp"], unit="s")
   df = df.set_index("receipt_timestamp")
   df = df[feature_cols]

   fc = FeatureCollection(
       [
           MultipleFeatureDescriptors(
               functions=catch22_wrapper(catch22_all),
               series_names=feature_cols,
               windows=["3s", "5s", "10s", "30s", "60s"],
               strides="1s"
           ),
           MultipleFeatureDescriptors(
               functions=tsfresh_settings_wrapper(MinimalFCParameters()),
               series_names=feature_cols,
               windows=["3s", "5s", "10s", "30s", "60s"],
               strides="1s"
           ),
           MultipleFeatureDescriptors(
               functions=[last_value],
               series_names=["mid"],
               windows=["3s"],
               strides="1s"
           )
       ]
   )

   feature_data = fc.calculate(df, return_df=True, approve_sparsity=True, show_progress=True)
   ```

cameron-hobbs avatar Aug 11 '24 08:08 cameron-hobbs