PowerInfer icon indicating copy to clipboard operation
PowerInfer copied to clipboard

Feature request : Support for PHI3 mini

Open raymond-infinitecode opened this issue 1 year ago • 0 comments

Prerequisites

Before submitting your issue, please ensure the following:

  • [ ] I am running the latest version of PowerInfer. Development is rapid, and as of now, there are no tagged versions.
  • [ ] I have carefully read and followed the instructions in the README.md.
  • [ ] I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).

Feature Description

PHI3 mini is currently the most powerful SLM yet, but can we relu it to make it fast so a single Xeon server can serve hundreds of concurrent users with relu implementation ?

Motivation

Please provide a detailed written description of reasons why this feature is necessary and how it is useful to PowerInfer users.

Possible Implementation

Convert the Phi3 model to relu model

raymond-infinitecode avatar Jul 14 '24 07:07 raymond-infinitecode