axlearn
axlearn copied to clipboard
expose max_decode_len and eos_token_id in decoding
Expose
- max_decode_len for beam_search to causal_lm module
- eos_token_id for sample_decode to causal_lm module
It looks like both have been added unit test in decoding_test.py. This change is to expose the parameter to causal_lm and decoder module.
Hi Guoli, what's the use case? Should we first discuss in an internal PR?
sg. let's discuss in an internal PR firstly.