llm.c icon indicating copy to clipboard operation
llm.c copied to clipboard

BitNet (b1.58) support

Open EwoutH opened this issue 1 year ago • 2 comments

First of all, thanks. We need more ramps.

I was curious what you think of BitNet, and if llm.c is a place where experimenting with it could be facilitated. The papers were extremely promising and got a lot of traction, but there while there have been a few (small scale) reproductions yet, there isn't a easy ramp to start experimenting with it.

Papers

image

EwoutH avatar May 28 '24 21:05 EwoutH

I don't think we have it on the current roadmap, Andrej can chime in. We have a lot of stuff on the backlog before we get here, including potentially supporting fp8, ZeRO stage 2, etc.

gordicaleksa avatar Jun 07 '24 13:06 gordicaleksa

The problem with BitNet (b1.58) training is that is still uses FP16/BF16 for training so the memory consumption does not decrease. Anyways getting support for it would be great! If used with FP8 training it could bring improvement.

kozuch avatar Jun 29 '24 14:06 kozuch