AITemplate [FEATURE] Support for NVIDIA T4 (Turing Architecture)

[FEATURE] Support for NVIDIA T4 (Turing Architecture)

Open philschmid opened this issue 3 years ago • 2 comments

Hello 🙋🏻‍♂️

It is very cool to see MetaAI going into inference optimization! This will help the community and companies to much long term speaking! While reading through the announcement blog post i noticed that

AITemplate is currently enabled on NVIDIA's A100 and AMD’s MI200 GPU systems, both of which are widely used today in data centers from technology companies, research labs, and cloud computing service providers.

Which awesome but might be a big limitation for many since A100 is still not very accessible. Having support for NVIDIA T4 (Turing), which is most widely available GPU in public clouds would be very helpful.

Oct 04 '22 07:10 philschmid

AITemplate is coming from Meta production needs, we don't have T4/V100 so in our first release we didn't consider about this. We will help to pass the voice to NVIDIA to see whether they can help.

Oct 04 '22 18:10 antinucleon

Hi @philschmid. There's another open source inference acceleration library called voltaML, which gives support for T4. Please check it out

https://github.com/VoltaML/voltaML

Oct 29 '22 05:10 harishprabhala

AITemplate AITemplate copied to clipboard

[FEATURE] Support for NVIDIA T4 (Turing Architecture)

AITemplate
AITemplate copied to clipboard