Anima Would adding Parallelism speed up AirLLM?

Would adding Parallelism speed up AirLLM?

Open birdup000 opened this issue 6 months ago • 0 comments

Hello, I can't help to ask if you have ever tried to implement any parallelism strategies to this program to help the inference in general as far as being able to quickly process through the model. At the moment I can't seem to find one that would suite AirLLM just from looking at the code itself. I am kind of determined to think it would make a impact as far as speed in loading or inferencing.

Dec 27 '23 01:12 birdup000

Anima Anima copied to clipboard

Would adding Parallelism speed up AirLLM?

Anima
Anima copied to clipboard