recommenders icon indicating copy to clipboard operation
recommenders copied to clipboard

[Discussion] Handle user ID with continuously training

Open xiaoyaoyang opened this issue 3 years ago • 1 comments

I want to open a discussion here as I feel the tutorial is good for running a small dataset, a one-pass model. However, in the deployment, there are more processes needed.

One thing I found might be useful is using the hash trick, it basically maps customer id to a fixed-length category bin (let's say 500K). The benefit is now we will have a fixed length parameters set. Then, I am thinking of continuous training, we could do a warm start with the previous model and train with new samples.

Want to see what this community thinks about this solution and any other ideas :)

xiaoyaoyang avatar Jun 28 '22 22:06 xiaoyaoyang

This is a very viable approach to continuous training with new ids entering the data - certainly worth trying!

maciejkula avatar Jul 12 '22 23:07 maciejkula