llama2.c
llama2.c copied to clipboard
tok512.model adding a token at start.
To reproduce: sp_mode.Encode('R') -> [410, 461] sp_model.Encode(sp_model.Decode(461)) -> [410, 461]
I am not able to understand this behaviour, not related to bos, eos, unk