Dayuxiaoshui

Results 5 issues of Dayuxiaoshui

This patch fixes issue #18423 where meta_schedule.tune_tir crashes during initial population sampling when RewriteParallelVectorizeUnroll postprocessor encounters blocks that violate compact dataflow requirements. The crash occurred when: - A block reads...

Fix Issue #18407: from_exported_program segfault with exported MHA using eq(0)/expand mask + in-place masked_fill_. Problem: When importing torch.export models with lifted tensors (e.g., from masked_fill_ operations), the conversion fails because...

## Summary This patch adds comprehensive RISC-V 64-bit architecture optimizations for LZ4 compression library, achieving **exceptional performance improvements** in decompression speed: - ✅ **Level 1 decompression**: **4.81x faster** (1215 MB/s...

## Description: ### Background: The LZ4 algorithm is a widely used, high-speed compression algorithm known for its fast performance, making it suitable for applications that require fast compression and decompression....

announce

Hi Density team, First of all, thank you for creating this fantastic high-performance compression library. I'm writing to propose the addition of RISC-V Vector (RVV) optimizations to further enhance the...