luminaire icon indicating copy to clipboard operation
luminaire copied to clipboard

Top 5% volatility filtering location DataExploration

Open vincent1in opened this issue 1 year ago • 1 comments

Code in exploration/data_exploration.py

def _shift_intensity(self, change_points=None, df=None, metric=None):
        """
        This function computes the Kullback_Leibler divergence of the the time series around a changepoint detected by the
        pelt_change_point_detection() function. This considers Gaussian assumption on the underlying data distribution.

        :param list change_points: A list storing indices of the potential change points
        :param pandas.dataframe df: A pandas dataframe containing time series ignoring the top 5% volatility
        :param str metric: A string in the dataframe column names that contains the time series
        :return: A list containing the magnitude of changes for every corresponding change points
        :rtype: list
        """

Question After looking through the code, I was wondering where the top 5% volatility dropped? It doesn't look like it's filtered anywhere before it.

Thank you for the help!

vincent1in avatar Jul 31 '23 02:07 vincent1in

@vincentlin2 This is a documentation error. Thanks for catching!

We had a logic of detecting changepoints after removing the volatilities beyond p95 which we later removed.

sayanchk avatar Aug 02 '23 19:08 sayanchk