antropy Update Katz

Fixed Katz_FD implementation by utilizing Euclidean distances. Also modified test cases to include tests for new method.

Sep 29 '24 03:09 PiethonProgram

Thank you @PiethonProgram — I'll aim to review the PR later this week. In the meantime, please make sure that the CI tests and lint tests are all passing.

Sep 30 '24 08:09 raphaelvallat

Applied formatting changes and should pass lint cases.

Changed self.assertEqual(np.round(katz_fd(x_k), 3), VALUE) from VALUE = 5.783 to 1.503 to reflect formula change Please double check calculations ^

Sep 30 '24 15:09 PiethonProgram

Should I be concerned with the unsuccessful checks : Python tests / build (macos-latest, 3.8) (pull_request) Python tests / build (macos-latest, 3.9) (pull_request) Python tests / build (macos-latest, 3.10) (pull_request)

It seems the issue is related to GitHub versions, and not the code itself.

Oct 05 '24 20:10 PiethonProgram

Yeah don't worry about the CI failures, I need to make some upgrade to the GitHub Actions workflow. Thanks!

Oct 06 '24 11:10 raphaelvallat

Reformatted and "simplified" code.

Note : Black formatting caused line 208 in fractal.py to expand into 7 lines (likely due to nested parenthesis restrictions) <= No impact on functions, just aesthetics.

Oct 10 '24 20:10 PiethonProgram

Hey,

Thanks again for the implementation and the PR.

I was looking at the output of the proposed implementation, and it seems that, regardless of the complexity of the input time-series, the output values are always between 1 and 1.01. The original paper described that the value should be ~1.5 for random signals:

Under this formulation, fractal dimensions range from D = 1.0, for strainght lines through approximately D = 1.15 for random-walk waveforms, to D approaching 1.5 for the most convoluted waveforms.

import stochastic.processes.noise as sn
rng = np.random.default_rng(seed=42)

X = np.vstack([
    np.arange(1000),
    np.sin(2 * np.pi * 1 * np.arange(1000) / 100),
    sn.FractionalGaussianNoise(hurst=0.1, rng=rng).sample(1000),
    sn.FractionalGaussianNoise(hurst=0.9, rng=rng).sample(1000),
    rng.random(1000),
    rng.random(1000)]
)
katz_new(X)

array([1.        , 1.00014255, 1.03727155, 1.00000014, 1.01079616, 1.01072551])

I also have found this paper which uses the same definition as the current implementation in Antropy:

fractalfract-08-00009.pdf

I also took a stab at improving the current implementation based on the neurokit2 implementation:

def katz(x):
    # Define total length of curve
    dists = np.abs(np.diff(x, axis=axis))
    length = np.sum(dists, axis=axis)
    
    # Average distance between successive points
    a = np.mean(dists, axis=axis)
    
    # Compute farthest distance between starting point and any other point
    d = np.max(np.abs(x.T - x[..., 0]).T, axis=axis)
    return np.log10(length / a) / (np.log10(d / a))

Oct 26 '24 12:10 raphaelvallat

Hi @raphaelvallat , thanks for the catch.

I will look into it for possible issues
I believe the implementation in the paper refers to single dimension time-series
will build and test on this. Solid implementation

Oct 29 '24 09:10 PiethonProgram

Hi @raphaelvallat ,

I wasn’t able to replicate the exact output from your implementation, but I’ve adjusted it to ensure it functions as expected. It now produces consistent results.
It appears that the approach described in the report differs from our current implementation, though both seem to yield the same output when tested.

Please let me know your thoughts.

Nov 09 '24 13:11 PiethonProgram

Hi, the changes have been made. Please let me know if there is anything else, and thank you for your cooperation as well.

Nov 14 '24 19:11 PiethonProgram

Update Katz_FD