PL-BERT icon indicating copy to clipboard operation
PL-BERT copied to clipboard

support for multiliungal tokenizer with hooks for malayalam, adding range support

Open dsplog opened this issue 8 months ago • 3 comments

fix for https://github.com/yl4579/PL-BERT/issues/29 and support for malaylam

    text = 'hello (1200 - 1230)'
    out = normalize_text(text)
    print(out)
    hello (one thousand two hundred to one thousand two hundred thirty)

dsplog avatar Nov 30 '23 03:11 dsplog