joss-reviews icon indicating copy to clipboard operation
joss-reviews copied to clipboard

[REVIEW]: TimeseriesSurrogates.jl: a Julia package for generating surrogate data

Open editorialbot opened this issue 2 years ago • 38 comments

Submitting author: @kahaaga (Kristian Agasøster Haaga) Repository: https://github.com/JuliaDynamics/TimeseriesSurrogates.jl Branch with paper.md (empty if default branch): main Version: v2.0.0 Editor: @jarvist Reviewers: @lucaferranti, @dawbarton Archive: Pending

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/8145914a409945b642e6fd6bd72c8d6a"><img src="https://joss.theoj.org/papers/8145914a409945b642e6fd6bd72c8d6a/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/8145914a409945b642e6fd6bd72c8d6a/status.svg)](https://joss.theoj.org/papers/8145914a409945b642e6fd6bd72c8d6a)

Reviewers and authors:

Please avoid lengthy details of difficulties in the review thread. Instead, please create a new issue in the target repository and link to those issues (especially acceptance-blockers) by leaving comments in the review thread below. (For completists: if the target issue tracker is also on GitHub, linking the review thread in the issue or vice versa will create corresponding breadcrumb trails in the link target.)

Reviewer instructions & questions

@lucaferranti & @dawbarton, your review will be checklist based. Each of you will have a separate checklist that you should update when carrying out your review. First of all you need to run this command in a separate comment to create the checklist:

@editorialbot generate my checklist

The reviewer guidelines are available here: https://joss.readthedocs.io/en/latest/reviewer_guidelines.html. Any questions/concerns please let @jarvist know.

Please start on your review when you are able, and be sure to complete your review in the next six weeks, at the very latest

Checklists

📝 Checklist for @lucaferranti

📝 Checklist for @dawbarton

editorialbot avatar May 23 '22 15:05 editorialbot

Hello humans, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

editorialbot avatar May 23 '22 15:05 editorialbot

Software report:

github.com/AlDanial/cloc v 1.88  T=0.04 s (1331.7 files/s, 114874.4 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Julia                           27            552            221           2124
Markdown                        18            204              0            684
MATLAB                           3            128            200            480
YAML                             6              8             11            177
TeX                              1             11              0            144
TOML                             3              5              0             54
-------------------------------------------------------------------------------
SUM:                            58            908            432           3663
-------------------------------------------------------------------------------


gitinspector failed to run statistical information for the repository

editorialbot avatar May 23 '22 15:05 editorialbot

Wordcount for paper.md is 1331

editorialbot avatar May 23 '22 15:05 editorialbot

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

OK DOIs

- 10.1016/j.physrep.2018.06.001 is OK
- 10.1137/141000671 is OK
- 10.1016/0167-2789(92)90102-s is OK
- 10.1016/0375-9601(94)91096-0 is OK
- 10.1103/physrevlett.77.635 is OK
- 10.1121/1.4929694 is OK
- 10.1103/physreve.74.026205 is OK
- 10.1103/physreve.73.036707 is OK
- 10.1103/physrevlett.87.188101 is OK
- 10.1103/physreve.59.4044 is OK
- 10.1103/physrevlett.101.134101 is OK
- 10.1103/physrevlett.81.4345 is OK
- 10.1103/physreve.85.056202 is OK

MISSING DOIs

- None

INVALID DOIs

- None

editorialbot avatar May 23 '22 15:05 editorialbot

:point_right::page_facing_up: Download article proof :page_facing_up: View article proof on GitHub :page_facing_up: :point_left:

editorialbot avatar May 23 '22 15:05 editorialbot

@dawbarton @lucaferranti - just confirming that you both got invited to this issue for the review!

jarvist avatar May 25 '22 16:05 jarvist

Review checklist for @lucaferranti

Conflict of interest

  • [x] I confirm that I have read the JOSS conflict of interest (COI) policy and that: I have no COIs with reviewing this work or that any perceived COIs have been waived by JOSS for the purpose of this review.

Code of Conduct

General checks

  • [x] Repository: Is the source code for this software available at the https://github.com/JuliaDynamics/TimeseriesSurrogates.jl?
  • [x] License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
  • [x] Contribution and authorship: Has the submitting author (@kahaaga) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?
  • [x] Substantial scholarly effort: Does this submission meet the scope eligibility described in the JOSS guidelines

Functionality

  • [x] Installation: Does installation proceed as outlined in the documentation?
  • [x] Functionality: Have the functional claims of the software been confirmed?
  • [x] Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

  • [x] A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
  • [x] Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
  • [x] Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
  • [x] Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
  • [x] Automated tests: Are there automated tests or manual steps described so that the functionality of the software can be verified?
  • [x] Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

  • [x] Summary: Has a clear description of the high-level functionality and purpose of the software for a diverse, non-specialist audience been provided?
  • [x] A statement of need: Does the paper have a section titled 'Statement of Need' that clearly states what problems the software is designed to solve and who the target audience is?
  • [x] State of the field: Do the authors describe how this software compares to other commonly-used packages?
  • [x] Quality of writing: Is the paper well written (i.e., it does not require editing for structure, language, or writing quality)?
  • [x] References: Is the list of references complete, and is everything cited appropriately that should be cited (e.g., papers, datasets, software)? Do references in the text use the proper citation syntax?

lucaferranti avatar May 25 '22 16:05 lucaferranti

@jarvist I didn't know reviewers have to generate their checklist, is it new? Anyway I generated my checklist and looks like I can edit, so I guess everything is setup? :)

lucaferranti avatar May 25 '22 16:05 lucaferranti

@kahaaga @jarvist I'll come back to this within a couple of weeks

lucaferranti avatar May 25 '22 16:05 lucaferranti

Hi @kahaaga (cc @jarvist ) overall a good submission! Here is the first batch of comments related to the code

Note: This PR is based on the code and docs in the main branch. I noticed you have a PR "JOSS paper" that with a quick look seems to address some of the comments I have below, could you clarify the status of that PR? Maybe it can be merged if it's ready?

Functionality

  • [ ] At the moment installation instructions are missing, it would be good to have those both in the README and in the docs homepage (see below)
  • [ ] The paper has some benchmarks, it would be good to include the code to reproduce those benchmarks. I guess there are several way to approach this, one idea could be to have a benchmarks article also in the documentation, where code snippets and results are included and explained and/or having a benchmark folder with e.g. 1) a julia file for the julia experiments 2) a matlab file for the matlab experiments 3) a postprocessing file for generating the results and graphs (this could be together with 1).

Documentation

  • [ ] Several requirements are currently, missing e.g. installation instructions, contributing guidelines etc., please see the checkpoints in the list above.
  • [ ] Regarding the structure, currently the homepage goes straight into the API docs, this could be a little overwhelming for a newcomer, I would suggest the homepage of the docs to include the followings (but feel free to add remove items if you prefer), it would also be good to have a more or less similar structure in the readme too.
    1. A general overview of the package
    2. installation instructions
    3. A small quickstart example, something small and easy to get the taste of the package
  • [ ] Currently the readme has only the link to the docs of the dev version but not to the latest stable release.

Other

a few minor comments

  • [ ] the CI badge in the readme points to travis, but looking at the PRs you seem to use github actions (indeed the latest action on travis seems to be one year old). I guess the travis badge is a leftover? In that case the .travis.yml could be removed too probably?
  • [ ] (very minor, feel free to ignore if you like) I noticed you are already using codecov to compute the coverage. Since you have it already setup, have you considered adding a badge and link to the coverage report in the readme? The coverage is 88%, so it would also look good in the repo :)

This was a first batch of comments, I'll next read the paper more carefully and check I can reproduce the examples in the docs.

lucaferranti avatar Jun 08 '22 05:06 lucaferranti

Hey @lucaferranti! Thanks for the initial comments.

This PR is based on the code and docs in the main branch. I noticed you have a PR "JOSS paper" that with a quick look seems to address some of the comments I have below, could you clarify the status of that PR? Maybe it can be merged if it's ready?

When submitting the paper, I had the option to select a branch to be considered for the publication. I chose the JOSSPaper branch, so that's the latest version of the paper/docs/etc. However, I see that it's more convenient if everything is on the master branch, so I merged it now. The documentation badge in the repo readme now points to the updated documentation.

I think that should address most of your initial comments. In particular:

The paper has some benchmarks, it would be good to include the code to reproduce those benchmarks. I guess there are several way to approach this, one idea could be to have a benchmarks article also in the documentation, where code snippets and results are included and explained and/or having a benchmark folder with e.g. 1) a julia file for the julia experiments 2) a matlab file for the matlab experiments 3) a postprocessing file for generating the results and graphs (this could be together with 1).

The benchmarks are now in the paper/timing/code folder of the repo.

Regarding the structure, currently the homepage goes straight into the API docs, this could be a little overwhelming for a newcomer, I would suggest the homepage of the docs to include the followings (but feel free to add remove items if you prefer), it would also be good to have a more or less similar structure in the readme too. 1) A general overview of the package 2) installation instructions 3) A small quickstart example, something small and easy to get the taste of the package

Please have a look at the updated documentation. It now includes installation instructions, a brief overview of the package (with a link to the dedicated "what is a surrogate" page).

As for the quickstart example, there are already plentyful of examples in the documentation - one for most of the methods. We'd prefer to keep the documentation as minimal as possible, without unnecessary repetition. Maybe it'd be enough to include a sentence in the opening paragraphs of the docs that examples are found under the individual method pages?

the CI badge in the readme points to travis

The CI badge now points to Github Actions, which are currently used.

(very minor, feel free to ignore if you like) I noticed you are already using codecov to compute the coverage. Since you have it already setup, have you considered adding a badge and link to the coverage report in the readme? The coverage is 88%, so it would also look good in the repo :)

A codecov badge is now included in the readme for the repo!

kahaaga avatar Jun 08 '22 08:06 kahaaga

Dear @lucaferranti , thank you very much for your work so far!

@dawbarton , have you had a chance to look at this yet?

jarvist avatar Jun 28 '22 11:06 jarvist

Apologies for the delay. I'll try to make sure that it gets done this week.

dawbarton avatar Jul 04 '22 14:07 dawbarton

Review checklist for @dawbarton

Conflict of interest

  • [x] I confirm that I have read the JOSS conflict of interest (COI) policy and that: I have no COIs with reviewing this work or that any perceived COIs have been waived by JOSS for the purpose of this review.

Code of Conduct

General checks

  • [x] Repository: Is the source code for this software available at the https://github.com/JuliaDynamics/TimeseriesSurrogates.jl?
  • [x] License: Does the repository contain a plain-text LICENSE file with the contents of an OSI approved software license?
  • [x] Contribution and authorship: Has the submitting author (@kahaaga) made major contributions to the software? Does the full list of paper authors seem appropriate and complete?
  • [x] Substantial scholarly effort: Does this submission meet the scope eligibility described in the JOSS guidelines

Functionality

  • [x] Installation: Does installation proceed as outlined in the documentation?
  • [x] Functionality: Have the functional claims of the software been confirmed?
  • [x] Performance: If there are any performance claims of the software, have they been confirmed? (If there are no claims, please check off this item.)

Documentation

  • [x] A statement of need: Do the authors clearly state what problems the software is designed to solve and who the target audience is?
  • [x] Installation instructions: Is there a clearly-stated list of dependencies? Ideally these should be handled with an automated package management solution.
  • [x] Example usage: Do the authors include examples of how to use the software (ideally to solve real-world analysis problems).
  • [x] Functionality documentation: Is the core functionality of the software documented to a satisfactory level (e.g., API method documentation)?
  • [x] Automated tests: Are there automated tests or manual steps described so that the functionality of the software can be verified?
  • [x] Community guidelines: Are there clear guidelines for third parties wishing to 1) Contribute to the software 2) Report issues or problems with the software 3) Seek support

Software paper

  • [x] Summary: Has a clear description of the high-level functionality and purpose of the software for a diverse, non-specialist audience been provided?
  • [x] A statement of need: Does the paper have a section titled 'Statement of need' that clearly states what problems the software is designed to solve, who the target audience is, and its relation to other work?
  • [x] State of the field: Do the authors describe how this software compares to other commonly-used packages?
  • [x] Quality of writing: Is the paper well written (i.e., it does not require editing for structure, language, or writing quality)?
  • [x] References: Is the list of references complete, and is everything cited appropriately that should be cited (e.g., papers, datasets, software)? Do references in the text use the proper citation syntax?

dawbarton avatar Jul 04 '22 14:07 dawbarton

I've now checked through the documentation and the paper. Overall, it looks good. The points previously raised by the other reviewer have been addressed and there isn't anything else substantive to address as far as I can see.

Two very minor things that I did pick up on:

  1. I tried the timing code - it was a bit annoying as it can't just be run without modification. Personally, I would separate out the data generation or the plotting code (or both) because at the moment you have to copy-and-paste the code in piecemeal because there is a circular dependency. (Matlab needs to use the same data that is generated in the Julia script but the Julia script also plots pre-saved Matlab results that can only be regenerated once you've got the data.) Also, the hardcoded paths are a bit annoying - you could use @__FILE__ or @__DIR__ instead.
  2. It might be worth putting a little more disambiguation of the term "surrogates" in the package readme / docs. It's a bit of an overloaded term with surrogate models being a significant area of interest at the moment (but completely different).

dawbarton avatar Jul 13 '22 13:07 dawbarton

How are things going @kahaaga ?

jarvist avatar Aug 01 '22 15:08 jarvist

@jarvist I'll have these comments addressed within a day or two.

kahaaga avatar Aug 01 '22 19:08 kahaaga

@jarvist Sorry for the delay!

We have now addressed @dawbarton's comments. The changes have been merged to the main branch.

I tried the timing code - it was a bit annoying as it can't just be run without modification. Personally, I would separate out the data generation or the plotting code (or both) because at the moment you have to copy-and-paste the code in piecemeal because there is a circular dependency. (Matlab needs to use the same data that is generated in the Julia script but the Julia script also plots pre-saved Matlab results that can only be regenerated once you've got the data.) Also, the hardcoded paths are a bit annoying - you could use @FILE or @DIR instead.

Good point. I now use @__DIR__ for file paths, and reorganised the timing scripts so that tasks are delegated a bit differently between MATLAB and Julia. Now, first run the MATLAB script (which will generate example data), then the Julia script (which will do the comparison and generate the plot for the paper).

I also increased the number of surrogates generated for the comparison for a bit more reliable timings (@jarvist note: the paper pdf must be re-generated to include the update timing plot).

It might be worth putting a little more disambiguation of the term "surrogates" in the package readme / docs. It's a bit of an overloaded term with surrogate models being a significant area of interest at the moment (but completely different).

The wording in the documentation was slightly adjusted to highlight that we are dealing with time series surrogates.

kahaaga avatar Aug 09 '22 10:08 kahaaga

@kahaaga apologies for the delay for finishing my review, I will get it done today

lucaferranti avatar Aug 09 '22 10:08 lucaferranti

@editorialbot generate pdf

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

@editorialbot help

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

Hello @lucaferranti, here are the things you can ask me to do:


# List all available commands
@editorialbot commands

# Get a list of all editors's GitHub handles
@editorialbot list editors

# Check the references of the paper for missing DOIs
@editorialbot check references

# Perform checks on the repository
@editorialbot check repository

# Adds a checklist for the reviewer using this command
@editorialbot generate my checklist

# Set a value for branch
@editorialbot set joss-paper as branch

# Generates the pdf paper
@editorialbot generate pdf

# Get a link to the complete list of reviewers
@editorialbot list reviewers

editorialbot avatar Aug 18 '22 06:08 editorialbot

@editorialbot generate pdf

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

@kahaaga @jarvist there is an error when generating the pdf, not sure if it's a bot issue or paper issue, can you have a look at it?

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

@lucaferranti I originally submitted the paper referring to the JOSSPaper branch, which no longer exists, because it was merged to main and deleted (see discussion above).

The error seems to be related to the fact that this branch no longer exists.

Screenshot 2022-08-18 at 08 43 05

Maybe running @editorialbot set main as branch, then re-generating the pdf will work?

kahaaga avatar Aug 18 '22 06:08 kahaaga

meanwhile @kahaaga, overview software and docs look good to me. As a small nitpick, currently the installation instructions in the docs homepage are last. This seems a little strange to me, I would suggest to move that section before the API section.

I will check the paper once I can generate the pdf.

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

Maybe running @editorialbot set main as branch, then re-generating the pdf will work?

I don't think a reviewer should be allowed to change the branch where the paper is generated from, that sounds like something the author and/or editor should do, can you try it?

lucaferranti avatar Aug 18 '22 06:08 lucaferranti

@editorialbot set main as branch

kahaaga avatar Aug 18 '22 06:08 kahaaga