Sean M. Law
Sean M. Law
It may be useful to reduce `numba` compilation time or, at the least, understand where the time is going. There were some initial comments [here](https://numba.discourse.group/t/reduce-compilation-time/969/9). This is NOT to be...
Consider using [ruff](https://docs.astral.sh/ruff/) to simplify our code linting/formatting (i.e., replace black, flake8, isort) We don't have a huge codebase so this may be pointless but I wanted to add to...
[Currently](https://github.com/TDAmeritrade/stumpy/discussions/1043#discussioncomment-11176471), it is very hard to retrieve the raw matrix profile values from the Pan Matrix Profile object and the `PAN_` property only returns transformed (i.e., normalized, contrasted, binarized) matrix...
The `core.py` file was meant to act as a central utility module where multiple other modules could find common functions. Over time, it has become bloated and it makes sense...
In all of our GPU calculations, we never utilize "shared memory" and, instead, rely heavily on storing data in the (fast but limited) "on-chip" and "thread local" GPU register. When...
Currently, the semantic IDs that are generated (4 integer values) appear to only have values between `[0,255]`. However, given that these are essentially a set of 4 tokens, don't we...
Currently, there appears to be a bug in the following function: https://github.com/snap-research/GRID/blob/4d8387382bf8bdfa7973a8d49feaf05e6a16b312/src/utils/inference_utils.py#L226-L257 as, during the initial embedding step, it warns (and eventually fails) with `Default process group has not been...
In the original TIGER paper, it was stated that: > In addition to the semantic codewords for items, we add user-specific tokens to the vocabulary. To keep the vocabulary size...
Hello and thank you for sharing this repository! In your [README](https://github.com/snap-research/GRID?tab=readme-ov-file#1-data-preparation), it says: > Prepare your dataset in the expected format: > > data/ > ├── train/ # training sequence...