llama.cpp common: free ctx_gguf when exiting llama_control_vector_load

common: free ctx_gguf when exiting llama_control_vector_load_one

Open stevegrubb opened this issue 9 months ago • 3 comments

ctx_gguf is initialized by gguf_init_from_file. It appears to be otherwise unused throughout the function. Someone more familar with the code might want to consider if gguf_init_from_file is necessary at all. But in the mean time, free the context when exiting so we don't leak memory.

May 14 '24 14:05 stevegrubb

There are more leaks in this function. #6289 has the fixes.

May 14 '24 14:05 slaren

do we use valgrind here and should we?

May 14 '24 14:05 mofosyne

This particular one was found by static analysis. The others I sent recently were found by ASAN. It's a lot faster than valgrind.

May 14 '24 15:05 stevegrubb

@stevegrubb looks like it might take a while for https://github.com/ggerganov/llama.cpp/pull/6289 to be merged in.

You may want to check if any of the leaks fixes from there is directly related to this PR and move it over, then it would make sense to merge this in.

May 18 '24 05:05 mofosyne

@mofosyne looks like the referenced patch contains the fix. My personal taste is not to mix features and bug fixes so that bug fixes can be cherry picked if needed. I'll close this.

May 21 '24 19:05 stevegrubb

llama.cpp llama.cpp copied to clipboard

common: free ctx_gguf when exiting llama_control_vector_load_one

llama.cpp
llama.cpp copied to clipboard