Transform-to-Open-Science icon indicating copy to clipboard operation
Transform-to-Open-Science copied to clipboard

Clarification of organizational structure for Year of Open Science training dataset documentation

Open erinleeryan opened this issue 2 years ago • 4 comments

The AI/ML Working Group is producing a number of datasets to be used for machine learning challenges. Along with these datasets we will all likely need to provide documentation on the datasets: either on their creation or methods to access/read/use the data.

It was suggested this would be the appropriate repo for such resources, but I would like to confirm that would be in line with the intended use before doing a PR, and would also like to know where such documents might be suggested to live in the current document structure.

erinleeryan avatar Feb 09 '23 21:02 erinleeryan

I think the most appropriate solution would be a new repository to storage those datasets, anyway we have to wait for someone from the NASA TOPS staff to answer about this...

marcelo-earth avatar Feb 09 '23 22:02 marcelo-earth

Ah, I guess that's one thing I should clarify: we are NOT talking about putting the datasets here, those will be on AWS. What was suggested to go here was documentation on dataset creation in Markdown and some Jupyter notebooks on how to read/operate/visualize data (and maybe some baseline model code as well).

erinleeryan avatar Feb 09 '23 22:02 erinleeryan

Ohh, sorry, I misinterpreted the first message!

marcelo-earth avatar Feb 09 '23 23:02 marcelo-earth

@pmbremner would this be appropriate for the sciencecore section, once developed?

cgentemann avatar Jun 08 '23 15:06 cgentemann