PowerInfer icon indicating copy to clipboard operation
PowerInfer copied to clipboard

Source for v2 (mobile inference engine)

Open peeteeman opened this issue 1 year ago • 9 comments

Hello there!

I came across the v2 paper yesterday, and saw the updates on the project readme.

I am interested in porting the v2 framework to iOS. The goal is to complete a naive port at first, and then include metal shaders.

Any plans on releasing the source and instructions for running v2 on Android?

peeteeman avatar Jun 12 '24 00:06 peeteeman

please release PowerInfer-2 so that it can be tested on low resource PCs (like llama.cpp) for a comparison.

0wwafa avatar Jun 12 '24 12:06 0wwafa

PowerInfer-2 will be open-sourced in the future. We’re refining it to untangle from our testing platform and making it accessible on PCs for the community.

jeremyyx avatar Jun 13 '24 02:06 jeremyyx

Can't wait to test your amazing work!

sqzhang-jeremy avatar Jun 13 '24 06:06 sqzhang-jeremy

Can't wait to test your amazing work!

same here! I wish to test it on low resource pc with no gpu or an old and small one.

0wwafa avatar Jun 13 '24 17:06 0wwafa

This is fantastic!, on my old smartphone with 6 Gb of memory the Meta-Llama-3-8B-Instruct-Q4_K_M.gguf model ran, I hope for v2 in the near future.

UUSR avatar Jun 14 '24 07:06 UUSR

when can i use it on anroid phone ?

Stephen888888 avatar Jun 19 '24 05:06 Stephen888888

PowerInfer-2 will be open-sourced in the future. We’re refining it to untangle from our testing platform and making it accessible on PCs for the community.

Is it possible you could release the testing platform and the code entangled with the testing platform, so that the reported results can be reproduced?

ethanc8 avatar Jun 22 '24 13:06 ethanc8

@YixinSong-e 想问一下什么时候会开源 Powerinfer-2 呢,从你们的论文看效果很好,但是如果不能复现那也是令人怀疑这个结果

dengwhao avatar Jan 10 '25 06:01 dengwhao

any update?

xiaoxiaosuaxuan avatar May 30 '25 04:05 xiaoxiaosuaxuan