rocket
rocket copied to clipboard
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels
ROCKET · MINIROCKET · HYDRA
ROCKET
ROCKET: Exceptionally fast and accurate time series classification using random convolutional kernels.
Data Mining and Knowledge Discovery / arXiv:1910.13051 (preprint)
Most methods for time series classification that attain state-of-the-art accuracy have high computational complexity, requiring significant training time even for smaller datasets, and are intractable for larger datasets. Additionally, many existing methods focus on a single type of feature such as shape or frequency. Building on the recent success of convolutional neural networks for time series classification, we show that simple linear classifiers using random convolutional kernels achieve state-of-the-art accuracy with a fraction of the computational expense of existing methods. Using this method, it is possible to train and test a classifier on all 85 ‘bake off’ datasets in the UCR archive inPlease cite as:
@article{dempster_etal_2020, author = {Dempster, Angus and Petitjean, Fran\c{c}ois and Webb, Geoffrey I}, title = {{ROCKET}: Exceptionally Fast and Accurate Time Series Classification Using Random Convolutional Kernels}, journal = {Data Mining and Knowledge Discovery}, year = {2020}, volume = {34}, number = {5}, pages = {1454--1495} }
sktimeAn implementation of ROCKET (with basic multivariate capability) is available through sktime. See the examples.
MINIROCKET *NEW*
MINIROCKET is up to 75× faster than ROCKET on larger datasets.
Results
UCR Archive
- Training / Test Split
- Resamples
Scalability
- Training Set Size
- Time Series Length
Code
rocket_functions.pyRequirements
- Python;
- Numba;
- NumPy;
- scikit-learn (or equivalent).
Example
from rocket_functions import generate_kernels, apply_kernels from sklearn.linear_model import RidgeClassifierCV [...] # load data, etc. # generate random kernels kernels = generate_kernels(X_training.shape[-1], 10_000) # transform training set and train classifier X_training_transform = apply_kernels(X_training, kernels) classifier = RidgeClassifierCV(alphas = np.logspace(-3, 3, 10), normalize = True) classifier.fit(X_training_transform, Y_training) # transform test set and predict X_test_transform = apply_kernels(X_test, kernels) predictions = classifier.predict(X_test_transform)Reproducing the Experiments
reproduce_experiments_ucr.pyArguments: -d --dataset_names : txt file of dataset names -i --input_path : parent directory for datasets -o --output_path : path for results -n --num_runs : number of runs (optional, default 10) -k --num_kernels : number of kernels (optional, default 10,000) Examples: > python reproduce_experiments_ucr.py -d bakeoff.txt -i ./Univariate_arff -o ./ > python reproduce_experiments_ucr.py -d additional.txt -i ./Univariate_arff -o ./ -n 1 -k 1000
reproduce_experiments_scalability.pyArguments: -tr --training_path : training dataset (csv) -te --test_path : test dataset (csv) -o --output_path : path for results -k --num_kernels : number of kernels Examples: > python reproduce_experiments_scalability.py -tr training.csv -te test.csv -o ./ -k 100 > python reproduce_experiments_scalability.py -tr training.csv -te test.csv -o ./ -k 1000Acknowledgements
We thank Professor Eamonn Keogh and all the people who have contributed to the UCR time series classification archive. Figures in our paper showing the ranking of different classifiers and variants of ROCKET were produced using code from Ismail Fawaz et al. (2019).
:rocket: