bros
bros copied to clipboard
How to convert BIO-tagged sequence to SPADE
Hi,
BIO is the dominant tagging strategy for token classification tasks. Could you provide an explanation of how to convert a BIO-tagged sequence to SPADE? This would be useful to fine-tine the SPADE-based EE model on custom datasets.
I know the same can be reverse-engineered from the codebase, but it'll be helpful if we have a concrete description of -
-
initial_tokens
,subsequent_tokens
- How to obtain them from a BIO-tagged sequence
- How to "combine" the predicted
initial_logits
andsubsequent_logits
to determine the final class prediction for each token.
Thanks.