heir Port Orion to HEIR

The Orion paper has a PyTorch -> Lattigo lowering with some ML-specific CKKS packing strategies. We should port their work to HEIR. It would be a good start to arithmetic-FHE for ML in HEIR, and give us another front-end (via torch-mlir), and a starting point for lowering meaningful programs to a CKKS dialect.

This would require at least:

Having a recipe to go from PyTorch to TOSA (probably we don't need to integrate torch-mlir proper into this project)
A Lattigo exit dialect with the relevant ops
A lowering TOSA -> lattigo
Somewhere to put the optimizations from the paper

Nov 17 '23 22:11 j2kun

After the talk they gave at Google, I'm reinvigorated to support their convolution method, which is a variant of SISO using a baby-step-giant-step method that seems very efficient.

The bootstrap placement algorithm they use also seems nice, but is a bit more complicated. In short, it seems to require simulating the neural network layer latency in order to build a graph, and then run a series of shortest-path algorithms to determine how to insert bootstrap ops. The main obstacle here is the simulation, which I believe is dataset specific. While we still don't have a sense of how we might incorporate such things into HEIR, we have had other cases in which knowing about the dataset can help the compiler be more optimal (e.g., in picking a polynomial approximation).

Jan 16 '25 21:01 j2kun

Looks like the code was open-sourced last month! https://github.com/baahl-nyu/orion

Apr 30 '25 20:04 AlexanderViand

Just a quick peek: looks like they also do tracing to get per-node min/max stats for poly eval: https://github.com/baahl-nyu/orion/blob/f0581052b28d02a00299cce742949930b3260aa8/orion/core/tracer.py#L149. This may help inform https://github.com/google/heir/issues/1700

May 01 '25 02:05 j2kun

Currently we are working on implementing the convolution kernel from Orion.

Sep 17 '25 05:09 j2kun

heir heir copied to clipboard

Port Orion to HEIR

heir
heir copied to clipboard