AITemplate icon indicating copy to clipboard operation
AITemplate copied to clipboard

AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

Results 169 AITemplate issues
Sort by recently updated
recently updated
newest added

Hi, I was wondering ... is there any easy way to use xformers with the AITemplate in stable diffusion? Since it lowers memory consumption, we can infer with resolution north...

Hello 🙋🏻‍♂️ It is very cool to see MetaAI going into inference optimization! This will help the community and companies to much long term speaking! While reading through the [announcement...

I referred official tutorial but I can't find the automatic conversion from PyTorch module to AIT module like ONNX->TensorRT. https://facebookincubator.github.io/AITemplate/tutorial/how_to_infer_pt.html#define-a-pytorch-module Is there any parser or converter building AIT module from...

It would be nice to be able to use different heights and widths up to 1024x1024.

I built a torchserve docker image on top of AITemplate docker. The demo code works fine in my docker. However, when I pack the AITemplate SD model to torchserve archiver...

When I compile a model that had `nn.Conv2dBiasFewChannels` I get this error: ```text Profile: conv2d_bias_few_channels_1: NI == 1 && HI == 384 && WI == 384 && CI == 4...

## summary I use AITemplate to re-construct a diffusion model which is slightly different than the one in examples, but error occurs while call compile_model(). Since it says some conv2d_bias...

This is a full repro: ```python import torch import numpy as np from collections import OrderedDict from aitemplate.testing import detect_target from aitemplate.frontend import nn, Tensor from aitemplate.compiler import compile_model def...

This is the UnitTest ```python import unittest import torch from aitemplate.compiler import compile_model, ops from aitemplate.frontend import Tensor from aitemplate.testing import detect_target class ConvGroupTestCase(unittest.TestCase): def test_fp16(self): groups = 256 #...