PyVideoResearch
PyVideoResearch copied to clipboard
I3D AVA input full image?
Hi, Could someone point me where the box information is used in I3D AVA. I think the whole image is being used (bounding box is not used any where)? Am I correct?
Regards, Srikanth
You mean for evaluation? The bounding boxes (that have arbitrary dimensions) are loaded under the meta
field:
https://github.com/gsig/PyVideoResearch/blob/46307b1a03ce670696297e2154ddee6f4e6b0b8a/models/criteria/maskrcnn_criterion.py#L46
This is then returned as score_targets
which is used for evaluation by the 'frcnn metric`:
https://github.com/gsig/PyVideoResearch/blob/46307b1a03ce670696297e2154ddee6f4e6b0b8a/metrics/frcnn_metric6.py#L41
The ava baseline is very much in development, so pull requests are welcome!