executorch
executorch copied to clipboard
how to realize the sliding window of kv cache?
hello, now I want to realize the sliding window of kv cache, so dynamic allocation and reclamation of memory needs to be realized. could you please teach me how to realize the dynamic allocation and reclamation of memory in the transformer? Thank you in advanced.