PL-BERT icon indicating copy to clipboard operation
PL-BERT copied to clipboard

No shards being saved

Open martinambrus opened this issue 5 months ago • 3 comments

I'm having trouble running the preprocess jupyter notebook you provided. I was trying to create PL-BERT for Slovak language but even when I try to run the code you provided, it would only work until shards processing. Once there, all I can see in code is "Processing shard XY ..." and then a lot of progress bars like this: Map: 0% ::::::::::::::::::::0/64587 [00:02<?, ? examples/s]

There is never an error raised and all (I tried 100 for a test) shards "complete". But there is nothing in the ./wiki_phoneme folder at all. I've no idea what is wrong. The WIKI dataset loads without any problems.

martinambrus avatar Mar 17 '24 17:03 martinambrus