simpletransformers icon indicating copy to clipboard operation
simpletransformers copied to clipboard

Using sliding window with Classification Model (BERT) and am now getting an error

Open superqd opened this issue 2 years ago • 3 comments

Describe the bug I'm re-running some training, though I've reinstalled simpletransfomers on a new machine, so maybe something has changed, but I'm using the sliding_window = True param for the ClassficationArgs, with a BERT model, and it now throws warnings/errors saying "Token indices sequence length is longer than the specified maximum sequence length for this model (708 > 512)" and so on.

Before moving to the new machine (which is Linux), I never saw this issue before, but now on the new machine, I am getting it all the time.

Expected behavior I would expect that using the sliding_window parameter would avoid this sort of warning / error.

Screenshots If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

  • OS : Linux (Ubuntu 20.4)

superqd avatar Jul 30 '22 05:07 superqd

Is it a warning or error? It's likely a check that got added to the Huggingface library in a recent update. I'll take a look later.

ThilinaRajapakse avatar Aug 15 '22 11:08 ThilinaRajapakse

I believe it came back as an error.

superqd avatar Aug 15 '22 14:08 superqd

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

stale[bot] avatar Nov 02 '22 01:11 stale[bot]