BitNet-Transformers icon indicating copy to clipboard operation
BitNet-Transformers copied to clipboard

How long does inference on CPU cost?

Open Ywandung-Lyou opened this issue 10 months ago • 0 comments

Training may be on CPU, but deployment has to be on CPU for high scalability.

Ywandung-Lyou avatar Apr 05 '24 05:04 Ywandung-Lyou