torchchat
torchchat copied to clipboard
[FEATURE REQUEST] Auto-detect llama2/llama3 from tokenizer.model in runner-aoti/runner-et
Remove -l 2
and -l 3
flags and auto-detect model architecture from tokenizer class.
Issue warning if user supplied -v does not match the tokenizer.model inferred vocab size.