deep-review icon indicating copy to clipboard operation
deep-review copied to clipboard

Add a Systematic Text-Mining Component to the Review?

Open swamidass opened this issue 3 years ago • 3 comments

So, what do you think about doing some quantitative analysis publications? E.g. look at the number of abstracts that use deep learning in particular domains? Count the number of authors? Look at key themes?

It seems that the field has just exploded, and adding a systematic component would be a very strong addition, and could be a good way to direct the paper. This would guide how we should update the text. Enough of us do informatics that I expect someone here already has PubMed abstracts downloaded and has requisite experience in text-mining. Give how large the review is (and well cited), it might be valuable to also look who who has cited the review and papers cited in the review. Here, we may need to reach out to someone who has citation data, but it could really be worth it.

What do you think @cgreene and @agitter?

swamidass avatar Nov 02 '20 04:11 swamidass

I like the idea. It would help us objectively assess what has changed in the domain since v1, adding to our own subjective takes.

The @greenelab has tools and data for https://greenelab.github.io/preprint-similarity-search/ that could contribute to this.

If someone does want to work on this, I'd like to think about how we could automate it so we could discuss a snapshot of the results but continue generating current versions. We have some good examples of this type of automated analysis in a Manubot manuscript that we could follow.

agitter avatar Nov 02 '20 16:11 agitter

Great. I like the idea and would like to participate in the design of the experiments and the write up.

How do we get citation data? Is there any good preprocessed versions of PubMed we can work off of so as to avoid reinventing the wheel?

swamidass avatar Nov 02 '20 23:11 swamidass

This might be a winner...

https://opencitations.net/index/coci

swamidass avatar Nov 02 '20 23:11 swamidass