AITemplate
AITemplate copied to clipboard
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
Hi, I was wondering ... is there any easy way to use xformers with the AITemplate in stable diffusion? Since it lowers memory consumption, we can infer with resolution north...
Hello 🙋🏻♂️ It is very cool to see MetaAI going into inference optimization! This will help the community and companies to much long term speaking! While reading through the [announcement...
I referred official tutorial but I can't find the automatic conversion from PyTorch module to AIT module like ONNX->TensorRT. https://facebookincubator.github.io/AITemplate/tutorial/how_to_infer_pt.html#define-a-pytorch-module Is there any parser or converter building AIT module from...
It would be nice to be able to use different heights and widths up to 1024x1024.
I built a torchserve docker image on top of AITemplate docker. The demo code works fine in my docker. However, when I pack the AITemplate SD model to torchserve archiver...
When I compile a model that had `nn.Conv2dBiasFewChannels` I get this error: ```text Profile: conv2d_bias_few_channels_1: NI == 1 && HI == 384 && WI == 384 && CI == 4...
## summary I use AITemplate to re-construct a diffusion model which is slightly different than the one in examples, but error occurs while call compile_model(). Since it says some conv2d_bias...
This is a full repro: ```python import torch import numpy as np from collections import OrderedDict from aitemplate.testing import detect_target from aitemplate.frontend import nn, Tensor from aitemplate.compiler import compile_model def...
This is the UnitTest ```python import unittest import torch from aitemplate.compiler import compile_model, ops from aitemplate.frontend import Tensor from aitemplate.testing import detect_target class ConvGroupTestCase(unittest.TestCase): def test_fp16(self): groups = 256 #...