ocannl issues

Implement "parameter punning" for the code notation `%cd`

1

This is really for show-off purposes. But it's just too tempting to not use it in the upcoming OCaml Workshop paper, and it adds extra consistency with the `%op` notation.

lukstafi

Superoptimizers for tensor programs

Study: [A Multi-Level Superoptimizer for Tensor Programs](https://arxiv.org/abs/2405.05751) see also: [Equality Saturation for Tensor Graph Superoptimization](https://arxiv.org/abs/2101.01332) (Found via Luminal Discord chat.)

lukstafi

explore

Implement an xLSTM LLM

1

https://arxiv.org/abs/2405.04517

lukstafi

explore

Consider reducing the dependencies on JaneStreet libraries; especially, break-up `ppx_jane`

They put a load on the CI. - Highest priority: `ppx_jane` pulls in a lot of packages. - Medium priority: we don't use much from `core`. - Low priority: get...

lukstafi

Move the `%cd` syntax to the arrayjit library / package

1

This would go a long way in making arrayjit useful independently of OCANNL. Some complications might come from interactions with shape inference. But the biggest obstacle is that `%cd` also...

lukstafi

Audit and/or more extensively test dimension label checking and inference

In my recent work I wasn't paying attention to dimension labels, only the dimensions (axis sizes) themselves. I suspect that they are not completely checked nor inferred.

lukstafi

Study and incorporate Andrej Karpathy's `llm.c` lessons

3

["A few new CUDA hacker friends joined the effort and now llm.c is only 2X slower than PyTorch"](https://twitter.com/karpathy/status/1778988957713477778) https://github.com/karpathy/llm.c

lukstafi

enhancement

Introduce a division operator that raises a shape error when division is with non-zero remainder

Audit all places where that would be needed.

lukstafi

bug

Add an LLVM / clang backend

1

Now no need to wait for LLVM 17: [bindings to LLVM 15 for OCaml 5 on opam](https://discuss.ocaml.org/t/ann-llvm-15-is-out/13019).

lukstafi

enhancement

Consider re-introducing dynamic indexing

In the current giant refactor, I'm removing dynamic indexing. Also perhaps simplifying the "local / global on device / on host" behavior.

lukstafi

explore

ocannl
ocannl copied to clipboard

Metadata

Implement "parameter punning" for the code notation `%cd`

Superoptimizers for tensor programs

Implement an xLSTM LLM

Consider reducing the dependencies on JaneStreet libraries; especially, break-up `ppx_jane`

Move the `%cd` syntax to the arrayjit library / package

Audit and/or more extensively test dimension label checking and inference

Study and incorporate Andrej Karpathy's `llm.c` lessons

Introduce a division operator that raises a shape error when division is with non-zero remainder

Add an LLVM / clang backend

Consider re-introducing dynamic indexing

← Metadata

Owner

Metadata

ocannl ocannl copied to clipboard

Metadata

← Metadata

Owner

Metadata

ocannl
ocannl copied to clipboard