truss
truss copied to clipboard
b10_lookahead speculation mode
Adds speculation mode BASETEN_LOOKAHEAD as an alias to:
speculator:
speculative_decoding_mode: LOOKAHEAD_DECODING
enable_b10_lookahead: true
lookahead_windows_size: 1
lookahead_ngram_size: 32
lookahead_verification_set_size: 1