Add a `prompt` interface

Open abrown opened this issue 1 year ago • 1 comments

Recent conversations (especially in the machine learning working group) suggest a need for a more specialized version of wasi-nn targeting LLMs. This change proposes such an interface, allowing users to access an ML graph directly using prompt strings instead of through the original tensor-based interface (i.e., inference).

Aug 13 '24 22:08 abrown

Are you assuming that all LLMs have intrinsic tokenization? Not all foundation models are transforms from string to string.

Is the implication is that kv-cache and other stateful items will be kept opaquely by this context rather than maintained by the caller?

Dec 04 '24 17:12 shschaefer