torchtitan icon indicating copy to clipboard operation
torchtitan copied to clipboard

Remove unnecessary .to() inside model forward

Open wconstab opened this issue 1 year ago • 0 comments

Stack from ghstack (oldest at bottom):

  • #161
  • -> #298

This appears to be a holdover from a previous way the initialization worked.

freqs_cis should already be on gpu device after initialization.

See this conversation for reference.

wconstab avatar May 02 '24 23:05 wconstab