Stack from ghstack (oldest at bottom):
As titled. This PR moves the token generation loop in llama2 runner into a new class so it can be reused.
Differential Revision: D61047601