llama.cpp
llama.cpp copied to clipboard
convert.py: Fix loading safetensors and ggml format on Windows
Calling mmap.mmap
on Windows apparently resets the file offset of the raw file object (and makes the BufferedReader return a negative file offset). For safetensors, avoid using the file offset after calling mmap. For GGML format, explicitly save and restore the offset.
Fixes #966.