detecting-scientific-claim icon indicating copy to clipboard operation
detecting-scientific-claim copied to clipboard

404 error - "https://s3-us-west-2.amazonaws.com/pubmed-rct/model_crf.tar.gz"

Open itsmemala opened this issue 4 years ago • 10 comments

The model link results in a 404 error. Has it been moved to a different bucket or not hosted any longer?

itsmemala avatar Jun 29 '20 19:06 itsmemala

Hi @itsmemala, yes, it seems like the host who I host the data removed it. I will try to put it up quite soon early next month!

titipata avatar Jun 29 '20 19:06 titipata

Thanks!

itsmemala avatar Jun 30 '20 12:06 itsmemala

I'm also looking for this to be solved

laviniaflorentina avatar Jul 10 '20 03:07 laviniaflorentina

Links that need fixing:

https://s3-us-west-2.amazonaws.com/pubmed-rct/train.json
https://s3-us-west-2.amazonaws.com/pubmed-rct/dev.json
https://s3-us-west-2.amazonaws.com/pubmed-rct/test.txt
https://s3-us-west-2.amazonaws.com/pubmed-rct/model.tar.gz

Thank you 🙏

laviniaflorentina avatar Jul 10 '20 03:07 laviniaflorentina

@laviniaflorentina thanks so much for the notice! @daniel-acuna Can I poke here if you store the deleted S3 somewhere?

titipata avatar Jul 10 '20 06:07 titipata

@laviniaflorentina @itsmemala I put on temporary model paths here: https://github.com/titipata/detecting-scientific-claim/blob/master/main.py#L37-L38. You can now run it. For train.json, dev.json and test.txt, I will update it later. In gist, it's a post-process file of dataset folder.

titipata avatar Jul 10 '20 07:07 titipata

Hi, I am facing this problem as well. I would like to ask if there is some way to solve it? Thank you very much if somebody could help me to solve it!

Shiyun-W avatar Jul 13 '23 21:07 Shiyun-W

Hi @Shiyun-W, unfortunately the model checkpoint was deleted. I might have to check if it's somewhere on my computer. In addition, the code is outdated with the AllenNLP old version.

titipata avatar Jul 14 '23 00:07 titipata

Hi @titipata, using the provided S3 bucket links, I am also not able to access the model and the annotated dataset. Can you please share the dataset of the annotated labels (claims and non-claims) for PubMedRCT dataset? Assuming sharing this small dataset is easier than the model weights? Thank you!

vibhor98 avatar Jul 27 '23 14:07 vibhor98

@vibhor98 yes, the dataset is available here https://github.com/titipata/detecting-scientific-claim/tree/master/dataset. I couldn't find the trained model since the bucket was deleted. I hope the provided notebook is sufficient for training the model. This codebase is kinda outdated.

titipata avatar Jul 27 '23 14:07 titipata