lightseq
lightseq copied to clipboard
Is it normal that A10 inference speed is lower than 2080ti?
hello?I tested the Transformer-base inference speed on different devices. It's weird that A10 speed is lower than 2080ti speed.
MODEL: Transformer-base DATA: fp16 SPEED: (number of src characters / second) 3090 7.5k/s 2080 4.5k/s A10 2.0K/s
me too. can anyone help?