LasUIE icon indicating copy to clipboard operation
LasUIE copied to clipboard

语料库问题

Open mch0115 opened this issue 2 years ago • 1 comments

在构建语料库的时候我根据论文中给出的链接想要下载wikipedia,但是进入了如下页面中,在点击下载后转到了github中,下载完文件中只有一个txt文件,在运行的时候会提示我FileNotFoundError: Unable to find '/home/machenghao/LasUIE-master/data/post-training/wikipedia-en/dev.txt',应该是语料库中的东西不全,请问这个是我下载的问题吗,我该怎么获得完整的语料库。希望能得到您的解答,万分感谢!

image

mch0115 avatar Dec 17 '23 01:12 mch0115

You got the whole dataset from Wikipedia, but the model needs train. Json, dev. Json and test.Json to run the model successfully. I suggest you divide the whole file into several parts as I mentioned above.

Whitefish-by avatar Mar 18 '24 14:03 Whitefish-by