chenglimin

Results 11 comments of chenglimin

For Falcon-40B, when you compare vLLM in Figure 18, it is mentioned in the paper that vLLM is compared on a single card A100(80G) GPU, but when Falcon-40B (Float16) is...