tvm icon indicating copy to clipboard operation
tvm copied to clipboard

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Results 636 tvm issues
Sort by recently updated
recently updated
newest added
trafficstars

Adds native FP8 type support for CUDA. The e4m3/e5m2 struct types provide explicit type conversions that target hardware native conversion ops. \* Conditionally run Storage and Compute legalization for targets...

Prior to this commit, `int`, `float`, and `bool` arguments from Python were converted to `IntImm`, `FloatImm`, and `Bool`. These are subtypes of `PrimExpr`, and should only be used at compile-time....

runtime:c++

Prior to this commit, `FuseOps` and `FuseOpsByPattern` exposed a symbolic variable to the fused function if it was used within the fused function, but wasn't inferable from other parameter shapes....

Internally, `FuseOpsByPattern` makes a mapping from relax variables to the fused group containing that variable. If the input module violates SSA, this map may be ill-formed. While not strictly necessary...

Prior to this commit, using `R.match_cast` as the last binding would produce a segfault, as `var_binding->value` was used instead of `match_cast->value`. In addition, because the last binding of each branch...

Currently, the pytorch wheels available through `pip install` use the pre-C++11 ABI by setting `-DUSE_CXX11_ABI=0` [0]. If TVM were to user the pre-C++11 ABI, this would cause breakages with dynamically-linked...

## Description This pull request aims to enhance model optimization by combining parts of Ansor and AutoTVM. The proposed approach involves the following steps: 1. Execution of Ansor over an...

Prior to this commit, the `MakePackedAPI` pass would output steps in the following order: 1. Check the number of arguments. 2. All `LetStmt` produced by the `ArgBinder` 3. `AssertStmt` for...

Currently, when a default compile target such as `llvm` is specified, it implies `llvm -keys=cpu` which tends to imply x86 related components being used during compilation e.g. the schedules registered...

### Expected behavior I have built oneDNN following this link https://oneapi-src.github.io/oneDNN/dev_guide_build.html. And I modify the config.cmake set(USE_DNNL ON). It should be able to build TVM with DNNL json BYOC feature...

type: bug
needs-triage