tvm issues

[LLVM][RUNTIME] Add optional LLVM ORCJIT runtime executor

1

This PR introduce a new LLVM ORC JIT executor for the runtime. The new ORCJIT may obsolete MCJIT due to better upstream maintenance. --- #### Changes * Old MCJIT is...

cbalint13

[TOPI][Testing] Enable conv2d NHWC fp16 topi testing for `arm_cpu`

This commit adds fp16 test cases to the conv2d NHWC TOPI schedules for `arm_cpu`. Following the example of #8529, the numpy reference conv2d output is computed in fp32 instead of...

Anndrey24

[DLight] Update Adreno GEMV Rules

When reduction axis is small, it's not necessary to use rfactor. This PR updates the gemv rule to use rfactor only when the reduction axis is large enough.

Hzfengsy

Support multinomial_from_uniform dispatch

5

This PR aims to support backend dispatching for `multinomial_from_uniform`, which includes: - Relax Op `multinomial_from_uniform` - TIR gpu kernel for `multinomial_from_uniform` - dispatching pass - TVMScript parser support for pure-python...

Hzfengsy

[TFLite][Frontend] Support quantized Reverse sequence

1

Support Reverse sequence quantization operation as part of #15148

tlopex

[BugFix][MetaSchedule] MultiLevelTilingTensorCore generates inconsistent thread-binding sketch for batched matmul

Below script can be used to reproduce the issue. You may need to run it multiple times to reproduce, because sample_perfect_tile may sometime to hide the issue with some decision....

tsu-bin

[SME][TOPI] Add conv2d NHWC SME fp32 schedule

2

This commit adds a scalable `arm_cpu` conv2d NHWC schedule for fp32 which generates SME instructions by using the tensor intrinsics introduced in #16921. Alongside the SME schedule, the logic of...

Anndrey24

[Tracking Issue] TFLite operator support

32

In https://github.com/apache/tvm/issues/9187 we implemented quantised version of operators in TFLite frontend. Recently, I just noticed a few more operators (with varying priorities) that can be taken as beginner friendly tasks,...

leandron

beginner-friendly

frontend:tflite

[TFLite][Frontend] Support quantized ARG_MIN

3

Support ARG_MIN quantization operation as part of #15148

tlopex

[SVE] Use only powers of two as possible vscale values

1

When analyzing scalable expressions, the analyzer will iterate over a series of known vscale values in the range 1-16. However, we can tighten this range to only values that are...

lhutton1

tvm
tvm copied to clipboard

Metadata

[LLVM][RUNTIME] Add optional LLVM ORCJIT runtime executor

[TOPI][Testing] Enable conv2d NHWC fp16 topi testing for `arm_cpu`

[DLight] Update Adreno GEMV Rules

Support multinomial_from_uniform dispatch

[TFLite][Frontend] Support quantized Reverse sequence

[BugFix][MetaSchedule] MultiLevelTilingTensorCore generates inconsistent thread-binding sketch for batched matmul

[SME][TOPI] Add conv2d NHWC SME fp32 schedule

[Tracking Issue] TFLite operator support

[TFLite][Frontend] Support quantized ARG_MIN

[SVE] Use only powers of two as possible vscale values

← Metadata

Owner

Metadata

tvm tvm copied to clipboard

Metadata

← Metadata

Owner

Metadata

tvm
tvm copied to clipboard