wikipedia-biography-dataset icon indicating copy to clipboard operation
wikipedia-biography-dataset copied to clipboard

How to analyze data in python?

Open kaghan opened this issue 6 years ago • 5 comments

My question is, when I extract the files, it gives me outputs with extensions of ".box", ".contributors", ".id", ".nb", ".sent" & ".title".

I would like to learn how can i use these files in my python codes?

kaghan avatar Jul 20 '18 05:07 kaghan

Hello, @DavidGrangier @kaghan I try to decompress the .z files follow the instructions in README but the system always raises the error unzip: cannot find zipfile directory in one of tmp.zip or tmp.zip.zip, and cannot find tmp.zip.ZIP, period. Have you ever encountered this problem?

ha-lins avatar Mar 05 '19 14:03 ha-lins

Hi,

Yes, I encountered the same problem as well but I need to check how I handled it because there has been a while since I finished that project.

I will check and let you know as soon as I figured out.

lins [email protected], 5 Mar 2019 Sal, 15:01 tarihinde şunu yazdı:

Hello, @DavidGrangier https://github.com/DavidGrangier @kaghan https://github.com/kaghan I try to decompress the .z files follow the instructions in README but the system always raises the error unzip: cannot find zipfile directory in one of tmp.zip or tmp.zip.zip, and cannot find tmp.zip.ZIP, period. Have you ever encountered this problem?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/DavidGrangier/wikipedia-biography-dataset/issues/2#issuecomment-469690229, or mute the thread https://github.com/notifications/unsubscribe-auth/AfkZew6AeJTDHESmDQKPzTmHcq1OATuHks5vTnjKgaJpZM4VXerT .

-- Kaan Karabal

kaghan avatar Mar 05 '19 14:03 kaghan

您好,我想知道.box”,“.contributors”,“.id”,“.nb”,“.sent这些文件怎样才能输出呢?

zhangzhang827 avatar Nov 22 '22 11:11 zhangzhang827

您好,我想知道.box”,“.contributors”,“.id”,“.nb”,“.sent这些文件怎样才能输出呢?

zhangzhang827 avatar Nov 22 '22 11:11 zhangzhang827

这些大部分就是文本文件的,你可以下载个 vscode 直接查看,或者使用 python 文件读入。

YingZhuY avatar Apr 13 '23 07:04 YingZhuY