mistral.rs
mistral.rs copied to clipboard
Initial KV RingAttention code
This is the start of the RingAttention code. The changes so far have been to create multiple KV caches (if multiple num_devices) and to try to create separate chunks.