xlnet [Question] How to use cached memory during finetuning ?

[Question] How to use cached memory during finetuning ?

Open astariul opened this issue 5 years ago • 1 comments

As mentioned here, cached memory is not used at finetuning time.

It is also mentioned that one can increase the maximum length at finetuning time, since relative position embeddings are used. However, increasing the size make the model slower (right ?).

How can one changes the existing examples (run_squad) to make the model use cached memory at finetuning time ?

Jul 02 '19 06:07 astariul

@zihangdai Any update on this issue ? Do you plan to release an example for such a use, or is it still not planned ?

Aug 27 '19 01:08 astariul

xlnet xlnet copied to clipboard

[Question] How to use cached memory during finetuning ?

xlnet
xlnet copied to clipboard