Snehal Patel

Results 24 comments of Snehal Patel

Is there any progress made with `build_vocab()` in 4.0 release? I just started with 105B tokens file and planning to train few versions of embedding models. Seems like _build_vocab_ is...

I agree with @tobegit3hub I'm dealing with similar issue with `tables_initializer()` and TF2.x code. How do I export my model for serving which is using StaticHashTables? I can't find any...

They have only processed and uploaded `en` text from WET files processed by `get_text_dataset_from_wet_downloads.py` script. That script is using fastText language identification model to identify text language and will save...

Yes, there are no filters. Files for other languages (120+) are being generated using that script in their respective directories. Not sure why they were not made available. Maybe because...