tvm issues

[TIR][CUDA] Add native FP8 support to codegen

1

Adds native FP8 type support for CUDA. The e4m3/e5m2 struct types provide explicit type conversions that target hardware native conversion ops. \* Conditionally run Storage and Compute legalization for targets...

csullivan

[FFI][RUNTIME] Introduce runtime boxed types for int/float/bool

4

Prior to this commit, `int`, `float`, and `bool` arguments from Python were converted to `IntImm`, `FloatImm`, and `Bool`. These are subtypes of `PrimExpr`, and should only be used at compile-time....

Lunderberg

runtime:c++

[Transform] Improve symbolic variable handling in FuseOps

5

Prior to this commit, `FuseOps` and `FuseOpsByPattern` exposed a symbolic variable to the fused function if it was used within the fused function, but wasn't inferable from other parameter shapes....

Lunderberg

[Unity][Transform] Raise error in FuseOpsByPattern for SSA violation

1

Internally, `FuseOpsByPattern` makes a mapping from relax variables to the fused group containing that variable. If the input module violates SSA, this map may be ill-formed. While not strictly necessary...

Lunderberg

[Bugfix][TVMScript] Handle R.match_cast as last binding in if/else

Prior to this commit, using `R.match_cast` as the last binding would produce a segfault, as `var_binding->value` was used instead of `match_cast->value`. In addition, because the last binding of each branch...

Lunderberg

[Lint] Add check to prevent usage of #include <regex>

3

Currently, the pytorch wheels available through `pip install` use the pre-C++11 ABI by setting `-DUSE_CXX11_ABI=0` [0]. If TVM were to user the pre-C++11 ABI, this would cause breakages with dynamically-linked...

Lunderberg

[ANSOR][AUTOTVM] Combine Ansor and AutoTVM to Improve Scheduling

8

## Description This pull request aims to enhance model optimization by combining parts of Ansor and AutoTVM. The proposed approach involves the following steps: 1. Execution of Ansor over an...

canesche

[TIR] Fix segfaults from ordering of Let/Assert in MakePackedAPI

2

Prior to this commit, the `MakePackedAPI` pass would output steps in the following order: 1. Check the number of arguments. 2. All `LetStmt` produced by the `ArgBinder` 3. `AssertStmt` for...

Lunderberg

[Target] Automatically detect system triple when not specified by the user

1

Currently, when a default compile target such as `llvm` is specified, it implies `llvm -keys=cpu` which tends to imply x86 related components being used during compilation e.g. the schedules registered...

lhutton1

[Bug] Building tvm with USE_DNNL=ON throws error

2

### Expected behavior I have built oneDNN following this link https://oneapi-src.github.io/oneDNN/dev_guide_build.html. And I modify the config.cmake set(USE_DNNL ON). It should be able to build TVM with DNNL json BYOC feature...

IssacXid

type: bug

needs-triage

tvm
tvm copied to clipboard

Metadata

[TIR][CUDA] Add native FP8 support to codegen

[FFI][RUNTIME] Introduce runtime boxed types for int/float/bool

[Transform] Improve symbolic variable handling in FuseOps

[Unity][Transform] Raise error in FuseOpsByPattern for SSA violation

[Bugfix][TVMScript] Handle R.match_cast as last binding in if/else

[Lint] Add check to prevent usage of #include <regex>

[ANSOR][AUTOTVM] Combine Ansor and AutoTVM to Improve Scheduling

[TIR] Fix segfaults from ordering of Let/Assert in MakePackedAPI

[Target] Automatically detect system triple when not specified by the user

[Bug] Building tvm with USE_DNNL=ON throws error

← Metadata

Owner

Metadata

tvm tvm copied to clipboard

Metadata

← Metadata

Owner

Metadata

tvm
tvm copied to clipboard