PyVideoResearch icon indicating copy to clipboard operation
PyVideoResearch copied to clipboard

I3D AVA input full image?

Open srikanthmalla opened this issue 5 years ago • 1 comments

Hi, Could someone point me where the box information is used in I3D AVA. I think the whole image is being used (bounding box is not used any where)? Am I correct?

Regards, Srikanth

srikanthmalla avatar Jul 14 '19 06:07 srikanthmalla

You mean for evaluation? The bounding boxes (that have arbitrary dimensions) are loaded under the meta field: https://github.com/gsig/PyVideoResearch/blob/46307b1a03ce670696297e2154ddee6f4e6b0b8a/models/criteria/maskrcnn_criterion.py#L46

This is then returned as score_targets which is used for evaluation by the 'frcnn metric`: https://github.com/gsig/PyVideoResearch/blob/46307b1a03ce670696297e2154ddee6f4e6b0b8a/metrics/frcnn_metric6.py#L41

The ava baseline is very much in development, so pull requests are welcome!

gsig avatar Jul 29 '19 19:07 gsig