ChatRWKV icon indicating copy to clipboard operation
ChatRWKV copied to clipboard

How to write the RWKV in autogressive style like RNN

Open HaiFengZeng opened this issue 1 year ago • 2 comments

instead of running in a teacher-forcing in training stage, can we run the model in free running style?:

for i in range(T):
    X_{t}=RNN(X_{t-1},hidden_state)

since teacher-forcing may have an exposure bias problem.

HaiFengZeng avatar Oct 11 '23 08:10 HaiFengZeng

yes you can

BlinkDL avatar Oct 13 '23 14:10 BlinkDL

see https://github.com/RWKV/RWKV-infctx-trainer

BlinkDL avatar Oct 29 '23 19:10 BlinkDL