Ikram Ali
Ikram Ali
Urduhack support python 3.6 and above.
Ok. I will look into this.
@mirfan899 will look into this issue.
Any progress on this?
> BTW @akkefa before we review the PR, could you make sure you approve circle CI? @ArthurZucker How I can approve circle CI flow?
@ArthurZucker @Ki-Seki Has it been fixed yet? I want to start working on it.
@ArthurZucker What should the behavior be when both tokenizer types encounter an OOV token? Should it simply raise an index error exception, or do you have something else in mind?
@ArthurZucker We can start a warning about this new change. After a certain version release, we can introduce a custom error exception for OOV word in tokenizer.
@ArthurZucker My plan is to start raising a warning in the fast tokenizer implementation so that users know when out-of-vocabulary (OOV) tokens are being ignored. In the normal tokenizer, OOV...