SaProt
SaProt copied to clipboard
More evaluation metrics
Hey, Could you provide more rigorous metrics, such as AUROC and AUPRC, rather than ACC for the classification tasks (such as HumanPPI or DeepLoc)? I think it can help to compare SaProt to other baselines better. Thank you!
Hi,
It's good to include more metrics to comprehensively evaluate the performance of different models! Since the results were recorded around 2 years ago, we didn't save all model checkpoints for that long time, and rerun the experiments requires a lot of computational resources. Nevertheless, following existing benchmarks, we think ACC is a good metric to reflect models' performance :)