AITemplate issues

ROCM backend sync

7

fsx950223

CLA Signed

module: rocm

run 05_stable_diffusion scripts/compile.py failed

2

run step: ``` # build docker image ./docker/build.sh cuda # run docker docker run -it --gpus=all ait:latest bash # run scripts cd /AITemplate/examples/05_stable_diffusion python3 scripts/download_pipeline.py python3 scripts/compile.py ``` error log:...

Lzhang-hub

[bug] Stable diffusion aitemplate img2img precision error

11

I use aitmplate for stable diffusion inference and complie success, but img2img result is different from pytorch diffusers img2img result. Two results are basically the same, but there are differences...

githubofhuo

When will quant and sparsity features be finished?

2

When will these two features be finished? 1) Quantization: fp8/int8/int4. 2) Sparsity pruning for Gemm. I am very much looking forward to :)

wenqibiao

Attention mask in Bert

Any plan to support attention mask in BERT?

fsx950223

[Bug] nn.Conv2d is not a generic kernel, it works only for specific numbers of channels.

12

When I try to compile simple convolution network, compilation process crash because conv2d.attrs_['op_instance'] is empty for the convolution layer. How can I fix it? Behaviour can be reproduced by this...

bes-dev

Issues running AIT benchmarking tools for Resnet, ViT on RTX 2080

5

Attempting to run the 04_vit benchmarking tool (benchmark_ait.py) hits an error when trying to load an op instance for one of the conv ops. ``` Traceback (most recent call last):...

outtanames

[Feature Request] Compressed-tile Matrix Multiply for autoregressive LLM inference

1

### Is your feature request related to a problem? Please describe. I would like to request the implementation of a compressed tiled matrix multiply operator for use in large language...

veritas9872

[Feature Request] Example for MIDAS

1

Hi! It would be great to have example (like with SD, detectotrone and resnet) for MiDaS Depth estimation model.

olegchomp

How to convert diffusion vae.encode into ait model?

6

Hi, I use the diffusion depth2img model, which needs to use vae. encode. So I want to convert diffusion vae.encode into ait model. However, the torch.nn.functional.pad function is required in...

xikakera

AITemplate
AITemplate copied to clipboard

Metadata

ROCM backend sync

run 05_stable_diffusion scripts/compile.py failed

[bug] Stable diffusion aitemplate img2img precision error

When will quant and sparsity features be finished?

Attention mask in Bert

[Bug] nn.Conv2d is not a generic kernel, it works only for specific numbers of channels.

Issues running AIT benchmarking tools for Resnet, ViT on RTX 2080

[Feature Request] Compressed-tile Matrix Multiply for autoregressive LLM inference

[Feature Request] Example for MIDAS

How to convert diffusion vae.encode into ait model?

← Metadata

Owner

Metadata

AITemplate AITemplate copied to clipboard

Metadata

← Metadata

Owner

Metadata

AITemplate
AITemplate copied to clipboard