ipex-llm
ipex-llm copied to clipboard
Fix low memory generation example issue in transformers 4.36
Fix low memory generation example issue in transformers 4.36. Related issue: https://github.com/analytics-zoo/nano/issues/1157 Can support all transformers versions of 4.31+. Test it under transformers 4.31 and 4.36.