aici
aici copied to clipboard
implement separate stream for memory copy
While at it, also measure mem transfer speed and see how many KV entries can be transferred in a single inference round
streams/events are now implemented but not tested