Jiun-Hao Jhan

Results 2 comments of Jiun-Hao Jhan

Thanks for your reply. How many GPUs do you use?

Sorry for not expressing clearly. How many GPUs do you use for inference in 100ms per token? I'd like to figure out the configuration, like the machine, models, and number...