tvm
tvm copied to clipboard
Open deep learning compiler stack for cpu, gpu and specialized accelerators
### Issue: [RISC-V RVV] avg_pool2d operator shows performance degradation #### Description The average pooling operator (avg_pool2d) shows performance regression with the RISC‑V Vector (RVV) extension, achieving only 0.621× the performance...
### Issue: [RISC-V RVV] round operator shows suboptimal vectorization #### Description The round operator performs worse with the RISC‑V Vector (RVV) extension, achieving only 0.547× the performance of the scalar...
### Issue: [RISC-V RVV] floor operator performance regression #### Description The floor operator shows performance degradation with the RISC‑V Vector (RVV) extension, achieving only 0.521× the performance of the scalar...
### Issue: [RISC-V RVV] sqrt operator shows poor vectorization performance #### Description The sqrt (square root) operator performs poorly with the RISC‑V Vector (RVV) extension, achieving only 0.385× the performance...
### Issue: [RISC-V RVV] Performance Issue: bias_add operator slower with vectorization #### Description The bias_add operator shows significant performance degradation when using the RISC‑V Vector (RVV) extension. With an acceleration...
### Issue: [RISC-V RVV] Performance Degradation: ReLU activation slower with vector extension #### Description The ReLU (rectified linear unit) operator shows significant performance degradation with the RISC‑V Vector (RVV) extension....
### Issue: [RISC-V RVV] Performance Issue: log operator slower on RVV #### Description The log operator exhibits performance regression with the RISC‑V Vector (RVV) extension enabled. The acceleration ratio is...
### Issue: [RISC-V RVV] Performance Regression: sum operator slower on RVV than RV #### Description The sum operator shows significant performance degradation when using the RISC‑V Vector (RVV) extension compared...
Hi Commiters, This PR is trying to fix issues https://github.com/apache/tvm/issues/17876. Any suggestions would be appreciated if you are available. ### Root Cause `VMShapeLower` crashed when processing `ShapeExpr` containing composite `PrimExpr`...
This patch fixes issue #18423 where meta_schedule.tune_tir crashes during initial population sampling when RewriteParallelVectorizeUnroll postprocessor encounters blocks that violate compact dataflow requirements. The crash occurred when: - A block reads...