TuringDataStories icon indicating copy to clipboard operation
TuringDataStories copied to clipboard

[Review] Machine Learning for the 20th century: Can a United Kingdom database from 1987 classify a glass cosmetics jar from the 1930s that might have belonged to Amelia Earhart?

Open crangelsmith opened this issue 2 years ago • 5 comments

Story Review: Issue 174

Story Name: Machine Learning for the 20th century: Can a United Kingdom database from 1987 classify a glass cosmetics jar from the 1930s that might have belonged to Amelia Earhart?

Submitting Author: joecerniglia

Pull Request: #176

Reviewers: @lukehare , Eirini Zormpa and Jennifed Ding

Reviewer instructions & questions

@lukehare, Eirini Zormpa and Jennifed Ding , please carry out your review in this issue by updating the checklist below, and writing new comments in case you have any questions. Specific comments about text can be done using the ReviewNB app linked to the PR. If you cannot edit the checklist please:

  • Make sure you're logged in to your GitHub account
  • Be sure to accept the invite at this URL: https://github.com/alan-turing-institute/TuringDataStories/settings/access

Any questions, concerns or suggestions regarding the review process please let @crangelsmith, @DavidBeavan or @samvanstroud know.

✨ Please start on your review when you are able, and be sure to complete your review in the next six weeks, at the very latest ✨

Review Checklist

Code of conduct

  • [ ] I confirm that I read and will adhere to the Turing Data Stories code of conduct.

General checks

  • [ ] Notebook: Is the source code for this data story available as a notebook in the linked pull request?
  • [ ] Contribution and authorship: Are the authors clearly listed? Does the author list seem appropriate and complete?
  • [ ] Scope and eligibility: Does the submission contain an original and complete analysis of open data? Is the story aligned with the Turing Data Stories vision statement?

Reproducibility

  • [ ] Does the notebook run in a local environment?
  • [ ] Does the notebook build and run in binder?
  • [ ] Are all data sources openly accessible and properly cited with a link?
  • [ ] Are the data open, and do they have an explicit licence, provenance and attribution?

Pedagogy

  • [ ] Does the story demonstrate some specific data analysis or visualisation techniques?
  • [ ] Are these techniques well motivated?
  • [ ] Are these techniques well implemented?
  • [ ] Is the notebook well documented, using both markdown cells and comments in code cells?
  • [ ] Does the notebook has a introduction section motivating the story?
  • [ ] Does the notebook has a conclusion section discussing the main insight from the stories?
  • [ ] Is the paper well written (it does not require editing for structure, language, or writing quality)?

Context

  • [ ] Does the story give an insight into some societal issue?
  • [ ] Is the context around this issue well referenced (newspaper articles, scientific papers, etc.)?

Ethical

  • [ ] Is any linkage of datasets in the story unlikely to lead to an increased risk of the personal identification of individuals?
  • [ ] Is the Story truthful and clear about any limitations of the analysis (and potential biases in data)?
  • [ ] Is the Story unlikely to lead to negative social outcomes, such as (but not limited to) increasing discrimination or injustice?

AOB

Please use the ReviewNB app to submit your reviews on the notebook, the story to be reviewed is under the name stories/2022-03-27_Glass_ML/ML_20th_Century.ipynb. Other files are complementary and do not need to be reviewed.

Agree between yourselves on how to divide the review work. For example, one could take the more technical review and others focus on the storytelling, etc.

crangelsmith avatar Jul 06 '22 12:07 crangelsmith

thank you for the explanation 😊 this link: https://github.com/alan-turing-institute/TuringDataStories/settings/access doesn't seem to work?

eirini-zormpa avatar Jul 13 '22 17:07 eirini-zormpa

thank you for the explanation 😊 this link: https://github.com/alan-turing-institute/TuringDataStories/settings/access doesn't seem to work?

Oh sorry about that! I have now added you as a collaborator to the project, so you should have access now.

crangelsmith avatar Jul 14 '22 10:07 crangelsmith

@crangelsmith could you add me as well? link currently doesn't work for me either!

dingaaling avatar Jul 14 '22 13:07 dingaaling

Review Checklist

Code of conduct

  • [X] I confirm that I read and will adhere to the Turing Data Stories code of conduct.

General checks

  • [X] Notebook: Is the source code for this data story available as a notebook in the linked pull request?
  • [X] Contribution and authorship: Are the authors clearly listed? Does the author list seem appropriate and complete?
  • [X] Scope and eligibility: Does the submission contain an original and complete analysis of open data? Is the story aligned with the Turing Data Stories vision statement?

Reproducibility

  • [ ] ~Does the notebook run in a local environment?~ Outstanding question for TDS team to decide on how to handle story-specific packages (See #205)
  • [ ] ~Does the notebook build and run in binder?~ Binder builds, but as above, dataset is missing.
  • [X] Are all data sources openly accessible and properly cited with a link?
  • [X] Are the data open, and do they have an explicit licence, provenance and attribution?

Pedagogy

  • [X] Does the story demonstrate some specific data analysis or visualisation techniques?
  • [X] Are these techniques well motivated?
  • [X] Are these techniques well implemented?
  • [X] Is the notebook well documented, using both markdown cells and comments in code cells?
  • [X] Does the notebook has a introduction section motivating the story?
  • [X] Does the notebook has a conclusion section discussing the main insight from the stories?
  • [X] Is the paper well written (it does not require editing for structure, language, or writing quality)?

Context

  • [X] Does the story give an insight into some societal issue?
  • [X] Is the context around this issue well referenced (newspaper articles, scientific papers, etc.)?

Ethical

  • [ ] Is any linkage of datasets in the story unlikely to lead to an increased risk of the personal identification of individuals?
  • [ ] Is the Story truthful and clear about any limitations of the analysis (and potential biases in data)?
  • [ ] Is the Story unlikely to lead to negative social outcomes, such as (but not limited to) increasing discrimination or injustice?

lukehare avatar Jul 20 '22 09:07 lukehare

Done 😊

From: Jennifer Ding @.> Date: Thursday, 14 July 2022 at 14:01 To: alan-turing-institute/TuringDataStories @.> Cc: Camila Rangel Smith @.>, Mention @.> Subject: Re: [alan-turing-institute/TuringDataStories] [Review] Machine Learning for the 20th century: Can a United Kingdom database from 1987 classify a glass cosmetics jar from the 1930s that might have belonged to Amelia Earhart? (Issue #187)

@crangelsmithhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fcrangelsmith&data=05%7C01%7Ccrangelsmith%40turing.ac.uk%7C55644f395d9b4f56f1d008da6598f29a%7C4395f4a7e4554f958a9f1fbaef6384f9%7C0%7C0%7C637934004820564906%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=gTQdpdODqnD4ULhfnTSqIPFe3u3su1hVWWvk6lDPMjU%3D&reserved=0 could you add me as well? link currently doesn't work for me either!

— Reply to this email directly, view it on GitHubhttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Falan-turing-institute%2FTuringDataStories%2Fissues%2F187%23issuecomment-1184420106&data=05%7C01%7Ccrangelsmith%40turing.ac.uk%7C55644f395d9b4f56f1d008da6598f29a%7C4395f4a7e4554f958a9f1fbaef6384f9%7C0%7C0%7C637934004820564906%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=2sUFSHqxAE2v2Pz%2FJ2EyaG8X8p8V3yQ%2BukztR8afUks%3D&reserved=0, or unsubscribehttps://eur01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fnotifications%2Funsubscribe-auth%2FACVFDWVRLSR2HJYYJDV552DVUAFR7ANCNFSM52ZUGJQA&data=05%7C01%7Ccrangelsmith%40turing.ac.uk%7C55644f395d9b4f56f1d008da6598f29a%7C4395f4a7e4554f958a9f1fbaef6384f9%7C0%7C0%7C637934004820564906%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=AgL0AXMdC%2BdJFF3%2FRazwyJD6F4JHBucW%2BFpEIs9R%2FOo%3D&reserved=0. You are receiving this because you were mentioned.Message ID: @.***>

crangelsmith avatar Oct 11 '22 06:10 crangelsmith