mTAN
mTAN copied to clipboard
Experimental Setup on Activity dataset
In your results table, there is a significant difference between the per-time-point classification accuracies of the Activity dataset, for the different RNN and ODE methods, compared to the results presented in the Latent-ODE paper (https://arxiv.org/pdf/1907.03907.pdf). For instance, you mention an Acc of 88.5% of ODE-RNN while 82.9% is mentioned in the original paper. Is there any difference in the experimental setup/metrics you follow? thanks.