AutoCompressors icon indicating copy to clipboard operation
AutoCompressors copied to clipboard

The usage of past_key_values in AutoCompressorMixin

Open RewindL opened this issue 4 months ago • 1 comments

Hello authors. There is no doubt that AutoCompressors is an excellent work.

I have carefully read and learned the mainly designed codes in auto_compressor.py and noticed that the class AutoCompressorMixin contains quite a bit of codes for processing past_key_values (including softprompt in it). But i checked the intermediate values of ​​past_key_values in forward() during training and found it seems always be None.

That confuses me and I have question that whether the processing of past_key_values ​​is redundant in your code for aligning standard interfaces for CasualLM , or for other possible purposes?If not, I'm curious about what the situation is past_key_values!=None during model forwarding ?

RewindL avatar Oct 10 '24 03:10 RewindL