iree issues

Migrate GCS files to new (ideally public) locations

4

We depend on a few files hosted in a GCP project using various buckets. Most uses can be discovered in this repo with a regex search of `https://storage\.googleapis\.com.*iree`: ``` 21...

ScottTodd

infrastructure

cleanup 🧹

Reshape fusion with multiple results

3

The current reshape propagation patterns upstream bail on multi-result operations, but the implementation seems to support such cases as far as I can tell: https://github.com/llvm/llvm-project/blob/3733528e521b7ee6af3950c65c3ff421c8fd0af6/mlir/lib/Dialect/Linalg/Transforms/ElementwiseOpFusion.cpp#L1253-L1258 This gist is an example...

Max191

Kernel Performance Benchmarking/Reporting

**Initial V0 Goal** Main Requirements: - Needs to be able to be run locally by developer easily (don't require a bunch of unnecessary requirements such as RocBLAS, HipBLASLt , etc.)...

saienduri

infrastructure

infrastructure/benchmark

[LinalgExt][Fusion] Fusion of attention + reshape on reduction dim causes lowering error

6

### What happened? During decode phase in sharktank paged LLM, there is a case where we expand shapes on one of attention's reduction dim/K2. When fused, this causes attention pipeline...

raikonenfnu

bug 🐞

[VectorDistribution] Plumb the VectorDistribute pipeline to support reduction operations (3/4)

5

Splitting https://github.com/iree-org/iree/pull/18519 into four patches. Depends https://github.com/iree-org/iree/pull/18784 and #18800. This is the third patch, including changes along the VectorDistribute pipeline to support reduction operations. Additionally, a relevant test has been...

bangtianliu

Document new external ONNX model and linalg operator test suites.

Progress on https://github.com/iree-org/iree-test-suites/issues/2 and https://github.com/iree-org/iree-test-suites/issues/6 .

ScottTodd

documentation

infrastructure

Add generalize matmul pass to sdxl fp16 benchmarks

Dispatch count is very sensitive to the placement of `tensor.expand_shape` and `tensor.collapse_shape` ops which often shows up as regressions in changes that shouldn't have a negative impact on dispatch count....

IanWood1

Add Apple GPU runners and run Metal tests again

Similar to https://github.com/iree-org/iree/issues/18814. We used to have a mac mini running macOS builds/tests, including of the Metal HAL. Right now we use [standard GitHub-hosted runners](https://docs.github.com/en/actions/using-github-hosted-runners/using-github-hosted-runners/about-github-hosted-runners#standard-github-hosted-runners-for-public-repositories) for macOS, which includes arm...

ScottTodd

infrastructure

platform/macos 🍎

hal/metal

Run Windows build/test workflows more regularly

1

We currently run the [`.github/workflows/ci_windows_x64_msvc.yml`](https://github.com/iree-org/iree/blob/main/.github/workflows/ci_windows_x64_msvc.yml) workflow on a nightly schedule using [standard GitHub-hosted runners](https://docs.github.com/en/actions/using-github-hosted-runners/using-github-hosted-runners/about-github-hosted-runners#standard-github-hosted-runners-for-public-repositories) (currently `windows-2022` with 4 CPU cores, 16 GB of RAM, and 14GB of SSD). Looking at...

ScottTodd

infrastructure

platform/windows 🚪

Add NVIDIA GPU runners and run CUDA tests again

We used to have some Linux runners with NVIDIA T4 and A100 GPUs on the GCP runner cluster. Our new Azure runner cluster currently only has Linux CPU runners. We...

ScottTodd

infrastructure

hal/vulkan

codegen/spirv

hal/cuda

codegen/nvvm

iree
iree copied to clipboard

Metadata

Migrate GCS files to new (ideally public) locations

Reshape fusion with multiple results

Kernel Performance Benchmarking/Reporting

[LinalgExt][Fusion] Fusion of attention + reshape on reduction dim causes lowering error

[VectorDistribution] Plumb the VectorDistribute pipeline to support reduction operations (3/4)

Document new external ONNX model and linalg operator test suites.

Add generalize matmul pass to sdxl fp16 benchmarks

Add Apple GPU runners and run Metal tests again

Run Windows build/test workflows more regularly

Add NVIDIA GPU runners and run CUDA tests again

← Metadata

Owner

Metadata

iree iree copied to clipboard

Metadata

← Metadata

Owner

Metadata

iree
iree copied to clipboard