AITemplate issues

Fix codegen condition check issue

2

Summary: Should check whether key present in dict, not whether dict is empty. Reviewed By: muchulee8 Differential Revision: D45759517

wushirong

CLA Signed

fb-exported

changes to bias module and sha/mha pass to adapt to removing presences

2

Summary: as titled The removal details are in D45632164 Reviewed By: jiaqizhai Differential Revision: D45644413 Privacy Context Container: L1138451

frank-wei

CLA Signed

fb-exported

Slow nn.Linear on MI250

2

Hi there, I tried to benchmark the performance of `nn.Linear` in AI Template on MI250 GPU and compared with rocBLAS. I expected AI Template should achieve a much higher throughput,...

comaniac

Summary: Now we can set LowerPrecision=BF16 during ads publish pipeline. However, this setting won't change the packaged sample_input's dtype, thus AIT lower pipeline would hit this error during lowering: ```...

wushirong

CLA Signed

fb-exported

fix bf16 lowering

1

Summary: 1. when enable bf16, `torch.ops.fbgemm.generic_histogram_binning_calibration_by_feature` in submod1 does not take bf16. So we need to cast its input to fp32 2. nan_to_num could handle bf16 now Differential Revision: D45421503

frank-wei

CLA Signed

fb-exported

python scripts/compile.py error

1

ake: Leaving directory '/data/bml/tool/AITemplate/examples/05_stable_diffusion/tmp/profiler' 2023-04-24 12:55:25,551 INFO make stderr: /usr/include/c++/11/bits/std_function.h:435:145: error: parameter packs not expanded with ‘...’: 435 | function(_Functor&& __f) | ^ /usr/include/c++/11/bits/std_function.h:435:145: note: ‘_ArgTypes’ /usr/include/c++/11/bits/std_function.h:530:146: error: parameter packs...

baimingl

Update fuse_split_linear_add

5

Summary: Currently, `fuse_split_linear_add` only supports cases when the split op kwarg uses a `slice`. This diff extends the fusion to support cases when the split op kwarg uses `int`s. The...

JiaJiunn

CLA Signed

fb-exported

Correct jagged total_length's upper bound

7

Summary: If the upper bound of the `total_length` dimension is set to a larger value than B * N (N being the logical max. sequence length), this would not change...

aakhundov

CLA Signed

fb-exported

Avoid WSL requirement on Windows?

5

I realize that you probably require the make tool (https://github.com/facebookincubator/AITemplate/issues/83#issuecomment-1312794318) which is only available proper in WSL, but on AMD platforms we do not support WSL with ROCm on Windows,...

jammm

Make imports more explicit to fix issues with lazy imports

2

Differential Revision: D45186695

brittanyrey

CLA Signed

fb-exported

AITemplate
AITemplate copied to clipboard

Metadata

Fix codegen condition check issue

changes to bias module and sha/mha pass to adapt to removing presences

Slow nn.Linear on MI250

Fix gap in ads model BF16

fix bf16 lowering

python scripts/compile.py error

Update fuse_split_linear_add

Correct jagged total_length's upper bound

Avoid WSL requirement on Windows?

Make imports more explicit to fix issues with lazy imports

← Metadata

Owner

Metadata

AITemplate AITemplate copied to clipboard

Metadata

← Metadata

Owner

Metadata

AITemplate
AITemplate copied to clipboard