pytorch-video-recognition icon indicating copy to clipboard operation
pytorch-video-recognition copied to clipboard

PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.

Results 48 pytorch-video-recognition issues
Sort by recently updated
recently updated
newest added
trafficstars

There is a bug about `normalize(self, buffer)` function in `dataset.py`, it has not normalize data to [0, 1], which we usually do this in Deep Learning training process with PyTorch....

This project is very interesting, thank you bloggers for sharing. Is this model really trained on ucf101? Why is it labeled 487?

Hi! Thanks for the repo, I've recently implemented your model, and I was wondering if you could tell what dataset was used to obtain the pretrained weights? Thanks

Your work is rather prefect! I got the train Acc 0.9987220447284345,val acc 0.9923857868020304 and test acc 0.9851063829787234,It seems too good to be true,can you check the result whether these results...

Sorry to bother! I used your pretrained model to extract video features on HMDB51 dataset, However, I find that every video has similar features, each dimension about the value 0.7.

When I run the train.py occur this problem, anyone can solve this? Traceback (most recent call last): File "/home/common1/huangjing/MyCode/PythonCode/pytorch-video-recognition/train.py", line 201, in train_model() File "/home/common1/huangjing/MyCode/PythonCode/pytorch-video-recognition/train.py", line 138, in train_model loss.backward()...

Hello @jfzhang95 , thanks for your code firstly. I'm trying to train C3D from scratch on my own ucf101 style dataset. I changed ucf101 config from 101 to 2 &...

I run inference.py with pretrained model: Traceback (most recent call last): File "inference.py", line 78, in main() File "inference.py", line 32, in main model.load_state_dict(checkpoint['state_dict']) KeyError: 'state_dict'

Hello @jfzhang95 , thanks for your share of c3d implement. When I inference with trained c3d model, I notice that you did some process to central croped frame. listed here:...

video is very lag with pretrained model in inference.py in any av