llama-gpt
llama-gpt copied to clipboard
Add mechanism to detect incomplete model download
Something like downloading the model as xxx.tmp
and then renaming to xxx
.
As downloading the model takes some time, if terminated, it will skip downloading the next time and fail starting the server because of the corrupted model.