Sean M. Law issues

Results 79 issues of


                                            Sean M. Law

Reduce Numba Compilation Time

It may be useful to reduce `numba` compilation time or, at the least, understand where the time is going. There were some initial comments [here](https://numba.discourse.group/t/reduce-compilation-time/969/9). This is NOT to be...

enhancement

Switch to Ruff

Consider using [ruff](https://docs.astral.sh/ruff/) to simplify our code linting/formatting (i.e., replace black, flake8, isort) We don't have a huge codebase so this may be pointless but I wanted to add to...

enhancement

testing

refactor

use case

Add New Class Property

[Currently](https://github.com/TDAmeritrade/stumpy/discussions/1043#discussioncomment-11176471), it is very hard to retrieve the raw matrix profile values from the Pan Matrix Profile object and the `PAN_` property only returns transformed (i.e., normalized, contrasted, binarized) matrix...

enhancement

Refactor core.py

The `core.py` file was meant to act as a central utility module where multiple other modules could find common functions. Over time, it has become bloated and it makes sense...

refactor

GPU Shared Memory

In all of our GPU calculations, we never utilize "shared memory" and, instead, rely heavily on storing data in the (fast but limited) "on-chip" and "thread local" GPU register. When...

enhancement

Shifting Semantic ID Values

Currently, the semantic IDs that are generated (4 integer values) appear to only have values between `[0,255]`. However, given that these are essentially a set of 4 tokens, don't we...

Bug in `src/utils/inference_utils.py`

Currently, there appears to be a bug in the following function: https://github.com/snap-research/GRID/blob/4d8387382bf8bdfa7973a8d49feaf05e6a16b312/src/utils/inference_utils.py#L226-L257 as, during the initial embedding step, it warns (and eventually fails) with `Default process group has not been...

TIGER User ID for Personalized Item Retrieval

In the original TIGER paper, it was stated that: > In addition to the semantic codewords for items, we add user-specific tokens to the vocabulary. To keep the vocabulary size...

Data Preparation

Hello and thank you for sharing this repository! In your [README](https://github.com/snap-research/GRID?tab=readme-ov-file#1-data-preparation), it says: > Prepare your dataset in the expected format: > > data/ > ├── train/ # training sequence...