scrape-up icon indicating copy to clipboard operation
scrape-up copied to clipboard

Feat: Scrapping Pubmed Data using BioPython and Beautiful Soup

Open VarsaGupta opened this issue 9 months ago • 1 comments

Describe the feature

Problem: PubMed offers crucial insights in life sciences, but extracting data conventionally can be slow, particularly with extensive research papers using the library like Beautiful Soup and Selenium. Solution: BioPython emerges as a game-changer in this scenario, offering a faster and more efficient solution. With specialized functionalities tailored for biological data analysis, BioPython streamlines the process of scraping PubMed data, enabling researchers to access valuable insights in few seconds.

My approach: I will provide the python script for both Beautiful Soup and BioPython for scrapping extensive research papers of Pubmed, showcasing the efficiency of BioPython.

Add ScreenShots

Uploading Screenshot 2024-05-03 205138.png…

Record

  • [X] I agree to follow this project's Code of Conduct
  • [X] I'm a GSSoC'24 contributor
  • [X] I want to work on this issue

VarsaGupta avatar May 13 '24 10:05 VarsaGupta

i would like to work on this issue. please assign it to me.

kashishrajputt avatar May 14 '24 18:05 kashishrajputt

Hey @VarsaGupta, the reason why you use Beautiful Soup is that it is lightweight. Packages like BioPython are much heavier. If you want to give it a try without using BioPython you can continue with the issue else can close this.

nikhil25803 avatar May 17 '24 07:05 nikhil25803

@nikhil25803 , okk i will give a try using Beautiful Soup. Could you please assign it a label ?

VarsaGupta avatar May 17 '24 19:05 VarsaGupta

Hey @nikhil25803, can you please look into my pull request.

VarsaGupta avatar May 31 '24 03:05 VarsaGupta