wikipedia-biography-dataset
wikipedia-biography-dataset copied to clipboard
How to analyze data in python?
My question is, when I extract the files, it gives me outputs with extensions of ".box", ".contributors", ".id", ".nb", ".sent" & ".title".
I would like to learn how can i use these files in my python codes?
Hello, @DavidGrangier @kaghan I try to decompress the .z files follow the instructions in README but the system always raises the error unzip: cannot find zipfile directory in one of tmp.zip or tmp.zip.zip, and cannot find tmp.zip.ZIP, period. Have you ever encountered this problem?
Hi,
Yes, I encountered the same problem as well but I need to check how I handled it because there has been a while since I finished that project.
I will check and let you know as soon as I figured out.
lins [email protected], 5 Mar 2019 Sal, 15:01 tarihinde şunu yazdı:
Hello, @DavidGrangier https://github.com/DavidGrangier @kaghan https://github.com/kaghan I try to decompress the .z files follow the instructions in README but the system always raises the error unzip: cannot find zipfile directory in one of tmp.zip or tmp.zip.zip, and cannot find tmp.zip.ZIP, period. Have you ever encountered this problem?
— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/DavidGrangier/wikipedia-biography-dataset/issues/2#issuecomment-469690229, or mute the thread https://github.com/notifications/unsubscribe-auth/AfkZew6AeJTDHESmDQKPzTmHcq1OATuHks5vTnjKgaJpZM4VXerT .
-- Kaan Karabal
您好,我想知道.box”,“.contributors”,“.id”,“.nb”,“.sent这些文件怎样才能输出呢?
您好,我想知道.box”,“.contributors”,“.id”,“.nb”,“.sent这些文件怎样才能输出呢?
这些大部分就是文本文件的,你可以下载个 vscode 直接查看,或者使用 python 文件读入。