TRACE
TRACE copied to clipboard
Performance very different to Action Genome baselines
Thanks for sharing the nice work!
But I find the performance presented in the paper is very different with methods in "Action Genome: Actions as Compositions of Spatio-temporal Scene Graphs" and "detecting human-object relationships in videos". What are the causes for this?