taxogen could you please tell me how can i get the data?

could you please tell me how can i get the data?

Open GuangyuanWang opened this issue 6 years ago • 10 comments

i cant find it anywhere

Feb 27 '18 06:02 GuangyuanWang

Sorry for the late reply. You can get the DBLP data used in the experiments here:

https://drive.google.com/file/d/1GbxKrxrmFrKt5vgDHP1xe1Qr_rfvR1jh/view?usp=sharing

Mar 12 '18 16:03 franticnerd

a great help to me, thanks.

Mar 13 '18 02:03 GuangyuanWang

Hi I was wondering if the seed_keywords.text and doc_ids.txt are supposed to be created before running and what is their content?

Nov 06 '18 12:11 nstfk

I've run the code. Probably, seed_keywords.text and doc_ids.txt is made by the preprocess code "cluster-preprocess.py", but the error message is hidden because of "time" command in "run.sh". So, I recommend checking your error message after remove "time" in "run.sh".

$fracivilization avatar$ Sep 09 '19 05:09 fracivilization

Does anyone know that how can I get the raw dataset?

Sep 29 '19 17:09 jsw-zorro

i downloaded that data, and fixed the path, but the problem is 'run.sh' gets errors, it says that: /seed_keywords.txt and doc_ids.txt are missing. actually they doesnt exist in the link you shared in gogle drive.

Oct 06 '19 14:10 mortezamsp

Does anyone know that how can I get the raw dataset?

https://drive.google.com/file/d/1GbxKrxrmFrKt5vgDHP1xe1Qr_rfvR1jh/view?usp=sharing
isn't it?

Oct 06 '19 14:10 mortezamsp

In the 'dblp' folder, duplicate the 'input' folder and rename it to 'raw'.

Dec 14 '21 08:12 SasCezar

How to create 'embeddings.txt' file in a custom dataset?

Apr 13 '22 02:04 Aravindkumar-Rajendran

salam i dont remember that project, but your question seems to have a genral answer: change code to accept a string instead of text file.

On Wed, Apr 13, 2022, 7:28 AM Aravind Kumar @.***> wrote:

How to create 'embeddings.txt' file in a custom dataset?

— Reply to this email directly, view it on GitHub https://github.com/franticnerd/taxogen/issues/1#issuecomment-1097500321, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHFQE7HZFGTGWUDZEIJGIJTVEYZ5ZANCNFSM4ESRUGNA . You are receiving this because you commented.Message ID: @.***>

Apr 13 '22 04:04 mortezamsp

taxogen taxogen copied to clipboard

could you please tell me how can i get the data?

taxogen
taxogen copied to clipboard