MicroLens
MicroLens copied to clipboard
[Issue] Problems Encountered Running `run_video.py`
Description
I have successfully downloaded the MicroLens dataset and am currently running the code provided by the authors. However, I've encountered several issues during the execution.
Issues
-
Clarification Needed on Dataset Selection In the file
MicroLens/Code/VideoRec/SASRec/run_video.py
, there is a line:max_video_no = 91717 # 34321 for 10wu, 91717 for 100wu
I am unsure what "10wu" and "100wu" refer to. Which setting should I use for the MicroLens-100k dataset?
-
Error Using
x3d-s
Video Model When running the code with the x3d-s video model, strictly following the parameters specified in the paper, the following traceback error occurs:
RuntimeError: input image (T: 5 H: 7 W: 7) smaller than kernel size (kT: 13 kH: 5 kW: 5)
The error points to an issue with the pooling operation in the video model. It seems the input image dimensions are too small for the kernel size. How should I adjust the kernel size or the input dimensions?
- Instability in Metrics When Using video-mae-base Model
Using the video-mae-base as the video model, I observed that the learning metrics Hit10
and nDCG10
fluctuate significantly during training, with a trend of occasionally approaching zero. What might be causing this instability, and how can it be resolved?
Request for Assistance
I would appreciate any insights or recommendations on addressing these issues, especially with the right dataset settings for MicroLens-100k
, the handling of input dimensions for the x3d-s
model, and strategies for stabilizing training metrics with the video-mae-base
model.
Thank you very much!