bros icon indicating copy to clipboard operation
bros copied to clipboard

How to convert BIO-tagged sequence to SPADE

Open MrinalJain17 opened this issue 7 months ago • 0 comments

Hi,

BIO is the dominant tagging strategy for token classification tasks. Could you provide an explanation of how to convert a BIO-tagged sequence to SPADE? This would be useful to fine-tine the SPADE-based EE model on custom datasets.

I know the same can be reverse-engineered from the codebase, but it'll be helpful if we have a concrete description of -

  • initial_tokens, subsequent_tokens
  • How to obtain them from a BIO-tagged sequence
  • How to "combine" the predicted initial_logits and subsequent_logits to determine the final class prediction for each token.

Thanks.

MrinalJain17 avatar Nov 27 '23 16:11 MrinalJain17