CogVLM
CogVLM copied to clipboard

Published 20 hours ago •

Reame
Issues

Why is it that running inference on two cards is slower than running on a single card?

Open Leeon-K opened this issue 6 months ago • 0 comments

cogvlm-chat-v1.1 model H800 machine Why is it that running inference on two cards is slower than running on a single card?"

Aug 07 '24 05:08 Leeon-K