Multimodal-End2end-Sparse can't reproduce the paper's performance

can't reproduce the paper's performance

Open JYeonKim opened this issue 1 year ago • 1 comments

Hi, Thank you very much for the code you shared.

I tried to run the code with the following arguments:

python main.py -lr=5e-5 -ep=40 -mod=tav -bs=8 --img-interval=500 --early-stop=6 --loss=bce --cuda=3 --model=mme2e_sparse --num-emotions=6 --trans-dim=64 --trans-nlayers=4 --trans-nheads=4 --text-lr-factor=10 -st=0.7 --text-model-size=base --text-max-len=100 --dataset="iemocap"

The papers syas that we get an accuracy of 84.4% but the best accuracy that I am getting is 83.0% on the IEMOCAP dataset. Could you please put the exact arguments to reproduce the results of the papers? How can I get the results of the papers?

(I think "p" equals "st" argument. So I set "st=0.7")

ACC	ang	exc	fru	hap	neu	sad	average
paper(p=0.7)	88.2	88.3	74.9	89.5	77.0	88.6	84.4
me(p=0.7)	88.8	80.6	77.9	90.4	74.5	86.0	83.0
me(p=0.9)	88.3	87.5	76.7	90.2	72.1	90.6	84.2

best regards, Juyeon Kim

Jan 12 '23 02:01 JYeonKim

Multimodal-End2end-Sparse Multimodal-End2end-Sparse copied to clipboard

can't reproduce the paper's performance

Multimodal-End2end-Sparse
Multimodal-End2end-Sparse copied to clipboard