stumpy
stumpy copied to clipboard
STUMPY is a powerful and scalable Python library for modern time series analysis
This is a work-in-progress pull request addressing issue #248, i.e. implementation of a method that reduces cache misses while computing the Matrix Profile using `stump`. # How it works right...
This unit test causes a failure: ``` import numpy as np import numpy.testing as npt import naive from stumpy import stump, config import pandas as pd def test_stump_identical_subsequence_self_join(): seed =...
This PR addresses issue #592. In this PR, we want to extend the function `stump` and the related ones so that it returns Top-K Nearest Neighbors Matrix Profile (i.e. the...
It would be nice to add a tutorial(s) that reproduces the [Matrix Profile Top Ten](https://github.com/TDAmeritrade/stumpy/files/4446948/zhu2020.pdf) paper. The accompanying data at their Google sites page can be [found here](https://sites.google.com/site/matrixprofiletopten/). It might...
This PR resolves issue #610 by adding a new config variable to reset the already-calculated pearson value to `1.0` when it exceeds the threshold set by the config variable.
I have been reading the module `mpdist` and I noticed: * The description of the output of `_mpdist_vect` is missing from its docstring * The description of `μ_Q` and `σ_Q`...
An initial tutorial has already been created [here](https://github.com/TDAmeritrade/stumpy/blob/main/docs/Tutorial_Pan_Matrix_Profile.ipynb) We need to add a final example where there are [two different window sizes within the same dataset](https://arxiv.org/pdf/1802.04883.pdf). The data can be...
Currently, `stumpy.stump` and `stumpy.stumped` can account for constant regions. So, when two subsequences are being compared and one subsequence is constant (and the second is not constant) then the pearson...
It seems that `stumpy.stumpi.py` needs to calculate `matrix profile (mp)` in the beginning of the algorithm. https://github.com/TDAmeritrade/stumpy/blob/576de379994896063be71d264e48a08ae17476cf/stumpy/stumpi.py#L118 And, then it will be updated quickly for each new individual, scaler value...
As our test suite gets longer, the `coverage` tests, which are executed in pure Python, will continue to need more time to complete. GIven that Github Actions has a job...