opencompass
opencompass copied to clipboard
add daily test case
add daily test. can run more models and datasets. if the score is not null and between baseline * 0.97 and baseline * 1.03, return true. ps, use newest pytorch version pr test does not create new conda, to improve stability.