tokenizers
tokenizers copied to clipboard
Added ability to inspect a 'Sequence' decoder and the `AddedVocabulary`.
This PR is in similar spirit to #1341 and adds a couple more functions that allow one to construct a modified version of an existing Tokenizer
. I've followed the existing style and conventions for newly introduced functions.
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
@ArthurZucker @Narsil gentle ping on this one and on #1444.
@ArthurZucker @Narsil gentle ping about this PR. This PR should not be controversial and is in similar spirit to #1341 (and has the same motivation).
really sorry about all the delays, lot happening on transformers, I'll free some time
Also can you add tests for set and get? 🤗
Sorry this fell through the cracks a bit over the past couple of weeks and I just saw the last couple of comments. Thanks for approving and merging this!
Thanks for your contribution 🤗