tydiqa
tydiqa copied to clipboard
Scripts for parsing the Wikipedia articles?
I'm looking at the process to prepare the passages from the raw Wikipedia dump (downloaded from the links in the repo), but unsure about how to determine the passage boundary. I wonder if the script is available anywhere if I didn't miss it?
And thanks for this great work!