densecap icon indicating copy to clipboard operation
densecap copied to clipboard

Question about the `recall` and `recall area`

Open textcunma opened this issue 4 years ago • 2 comments

I tried the code in this repo and got the result proposal recall area 41.133031, which is much lower than the Average Recall value (52.95, from Tab. 4) shown in your paper. Meanwhile, the Recall value reported in Average across all tIoUs was 78.6760 which seems to be more reasonable. So I'm wondering which one is the Average Recall?

textcunma avatar Jul 30 '21 03:07 textcunma

As how I understand it, the 52.95 from the paper are only for the Event proposal part and not for the captioning results on learned event proposals. You also did the evaluation for this part?

I got a Recall of 87.2372 for the captioning part.

josch14 avatar Jul 30 '21 15:07 josch14

The following is the output from the trained masked transformer model:

--------------------------------------------------------------------------------
Average across all tIoUs
--------------------------------------------------------------------------------
| Bleu_1: 10.8643
| Bleu_2: 5.1233
| Bleu_3: 2.5911
| Bleu_4: 1.2892
| METEOR: 6.7267
| ROUGE_L: 10.3282
| CIDEr: 9.6086
| Recall: 78.6760
| Precision: 34.1658

[INIT] Loaded annotations from validation subset.
	Number of ground truth instances: 16654
	Number of proposals: 550392
	Fixed threshold for tiou score: [0.5  0.55 0.6  0.65 0.7  0.75 0.8  0.85 0.9  0.95]
[RESULTS] Performance on ActivityNet proposal task.
	Area Under the AR vs AN curve: 41.13303110363877%
proposal recall area: 41.133031

And the following is the output from the trained end-to-end masked transformer model:

--------------------------------------------------------------------------------
Average across all tIoUs
--------------------------------------------------------------------------------
| Bleu_1: 10.1033
| Bleu_2: 4.8944
| Bleu_3: 2.4937
| Bleu_4: 1.2244
| METEOR: 6.0311
| ROUGE_L: 8.6687
| CIDEr: 9.31356
| Recall: 91.4836
| Precision: 25.6057

[INIT] Loaded annotations from validation subset.
	Number of ground truth instances: 16654
	Number of proposals: 1064099
	Fixed threshold for tiou score: [0.5  0.55 0.6  0.65 0.7  0.75 0.8  0.85 0.9  0.95]
[RESULTS] Performance on ActivityNet proposal task.
	Area Under the AR vs AN curve: 45.02161042392217%
proposal recall area: 45.021610

As you can see, there is a Recall (in the Average across all tIoUs section) and a proposal recall area (at the last line) in the output. Which one is the value 87.2372 you talked about?

textcunma avatar Aug 06 '21 07:08 textcunma