PL-BERT
PL-BERT copied to clipboard
support for multiliungal tokenizer with hooks for malayalam, adding range support
fix for https://github.com/yl4579/PL-BERT/issues/29 and support for malaylam
text = 'hello (1200 - 1230)'
out = normalize_text(text)
print(out)
hello (one thousand two hundred to one thousand two hundred thirty)