FastChat
FastChat copied to clipboard
GPU memory keeps growing with more talks, but the GPU memory is not released when the tallk is closed.
I used the local model about 40GB, then increased it to 50GB as the coversation increased, but couldn't free the GPU when exited the conversations