bigbird
bigbird copied to clipboard
How is Prior Arts, which can only accept short text input, evaluated on long text datasets.
Such as Attn-Seq2Seq