server
server copied to clipboard
after calling unload_model capi, the memory is not completely released
Description
A clear and concise description of what the bug is.
before calling unloadmodel,memory isbelow:
and after calling unloadmodel,memory isbelow:
Triton Information What version of Triton are you using? 2.40.0dev
Are you using the Triton container or did you build it yourself? build it by myself
To Reproduce
Steps to reproduce the behavior.
Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).
Expected behavior A clear and concise description of what you expected to happen. release memory completely
CC @kthui
@indrajit96 can you give me some suggestions?
@kthui Any workarounds here?