Dhanachandra Ningthoujam

Results 3 comments of Dhanachandra Ningthoujam

@bradfox2 , @peregilk You can use a modified version of Tensor2Tensor/text_encoder_build_subword.py code to generate BERT compatible vocab. https://github.com/kwonmha/bert-vocab-builder

> @irhallac it is the [unusedXXX]-tokens that can be replaced with any word you like. I am running some experiments on how effective this really is, but from my understanding...

> Hi everyone, > > I am lead author on this paper. Apologies for the radio silence on this request. We are currently working on a revision to the paper/approach...