llama.cpp
llama.cpp copied to clipboard
Refactoring: Use STL in input processing and fix off-by-one bug
There was an off-by-one bug in the line
if (embd.size() > params.n_batch) {
break;
}
it would copy n_batch+1
elements rather than n_batch
. This PR fixes that and also switches to using STL instead of a while loop