PaddleVideo
PaddleVideo copied to clipboard
Call for suggestions
This issue is for collecting suggestions. You can either: 1.Suggest a new feature by leaving a comment. 2.Vote for a feature request with +1 or be against with -1. 3.Tell us that you would like to help implement one of the features in the list.
- Optimize some video models , e.g. pp-TSM, by pruning, quantization, knowledge distillation, better backbone, to achieve a better balance between effectiveness(Top1) and efficiency(VPS).
- Release some tools for some video datasets, e.g. downloading Kinetics-400, Youtube-8M.
- Release some applications using video models, e.g. video tagging, Highlight snippet extraction, Security Monitoring, Video retrieval, Student behavior analysis in classroom and so on.
- Support X3D and Multi-Grid by FAIR
- Add some video models technology analysis docs, e.g. TSN, TSM, SlowFast, X3D, Multi-Grid and so on.
- Add more lightweight video models and Optimization Strategy.
- Support more backbones and some pretrained backbone models for video.
- Support ava dataset and spatio-temporal action detection models.
- Optimize Video decoding speed by DALI.
- Support Distributed training for video models .
- Add ECO: Efficient Convolutional Network for Online Video Understanding, ECCV 2018
- Add 3D Resnet: Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs, CVPR 2018
- Add TPN: Temporal Pyramid Network for Action Recognition, CVPR 2020
- Add EvaNet: Evolving Space-Time Neural Architectures for Videos, ICCV 2019
- Add RepFlow: Representation Flow for Action Recognition, CVPR 2019
- Add MARS: Motion-Augmented RGB Stream for Action Recognition, CVPR 2019
- Add StNet: Local and Global Spatial-Temporal Modeling for Human Action Recognition, AAAI 2019
- Add Attention Cluster: Purely Attention Based Local Feature Integration for Video Classification
- Add NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification
- Add C-TCN: Action localization Model by Baidu, the Champion model of ActivityNet 2018
Maybe provide a readthedocs.io version ducomentation? It shall be easier to read and navigate than the current version which lives inside github repo
Maybe provide a readthedocs.io version ducomentation? It shall be easier to read and navigate than the current version which lives inside github repo
We have received your suggestion. We will evaluate the demand and determine R & D schedule as soon as possible.
Can you provide a video prompt ipynb for detection and tracking + slowfast for the ava data set