Contributions icon indicating copy to clipboard operation
Contributions copied to clipboard

EDIRquery

Open lauravongoc opened this issue 3 years ago • 6 comments
trafficstars

Update the following URL to point to the GitHub repository of the package you wish to submit to Bioconductor

  • Repository: https://github.com/lauravongoc/EDIRquery

Confirm the following by editing each check box to '[x]'

  • [x] I understand that by submitting my package to Bioconductor, the package source and all review commentary are visible to the general public.

  • [x] I have read the Bioconductor Package Submission instructions. My package is consistent with the Bioconductor Package Guidelines.

  • [x] I understand Bioconductor Package Naming Policy and acknowledge Bioconductor may retain use of package name.

  • [x] I understand that a minimum requirement for package acceptance is to pass R CMD check and R CMD BiocCheck with no ERROR or WARNINGS. Passing these checks does not result in automatic acceptance. The package will then undergo a formal review and recommendations for acceptance regarding other Bioconductor standards will be addressed.

  • [x] My package addresses statistical or bioinformatic issues related to the analysis and comprehension of high throughput genomic data.

  • [x] I am committed to the long-term maintenance of my package. This includes monitoring the support site for issues that users may have, subscribing to the bioc-devel mailing list to stay aware of developments in the Bioconductor community, responding promptly to requests for updates from the Core team in response to changes in R or underlying software.

  • [x] I am familiar with the Bioconductor code of conduct and agree to abide by it.

I am familiar with the essential aspects of Bioconductor software management, including:

  • [x] The 'devel' branch for new packages and features.
  • [x] The stable 'release' branch, made available every six months, for bug fixes.
  • [x] Bioconductor version control using Git (optionally via GitHub).

For questions/help about the submission process, including questions about the output of the automatic reports generated by the SPB (Single Package Builder), please use the #package-submission channel of our Community Slack. Follow the link on the home page of the Bioconductor website to sign up.

lauravongoc avatar Aug 30 '22 19:08 lauravongoc

Hi @lauravongoc

Thanks for submitting your package. We are taking a quick look at it and you will hear back from us soon.

The DESCRIPTION file for this package is:

Package: EDIRquery
Title: Query the EDIR Database For Specific Gene
Version: 0.99.0
Authors@R: 
    person("Laura D.T.", "Vo Ngoc", , "[email protected]", 
  role = c("aut", "cre"), comment = c(ORCID = "0000-0002-1597-900X"))
Description: EDIRquery provides a tool to search for genes of interest within the Exome Database of Interspersed Repeats (EDIR). A gene name is a required input, and users can additionally specify repeat sequence lengths, minimum and maximum distance between sequences, and whether to allow a 1-bp mismatch. Outputs include a summary of results by repeat length, as well as a dataframe of query results. Example data provided includes a subset of the data for the gene GAA (ENSG00000171298). To query the full database requires providing a path to the downloaded database files as a parameter.
License: GPL (>= 3)
Encoding: UTF-8
biocViews: Genetics, SequenceMatching
Roxygen: list(markdown = TRUE)
RoxygenNote: 7.1.2
Depends: 
    R (>= 4.1.0)
Imports:
    tibble (>= 3.1.6),
    tictoc (>= 1.0.1),
    utils (>= 4.1.3),
    stats (>= 4.1.3),
    readr (>= 2.1.2)
Suggests: 
    knitr, 
    rmarkdown
VignetteBuilder: knitr

bioc-issue-bot avatar Aug 30 '22 19:08 bioc-issue-bot

Thanks for this submission. The package does not use any Bioconductor data structures. A GRanges instance would probably be useful for

head(results)
#> chromosome repeat_length repeat_seq start end repeat_seq2 start2
#> 3930 17 7 CCGCGGG 80101595 80101602 CCGCGGG 80101734
#> 3931 17 7 CCGAGGC 80105602 80105609 CCGAGGA 80105843
#> 3932 17 7 CGGAGGG 80110005 80110012 GCGAGGG 80110061
#> 3933 17 7 CCAAGGG 80118254 80118261 CCGAGGG 80118270
#> 3934 17 7 CCGAGGG 80118270 80118277 GCGAGGG 80118318
#> 3935 17 7 CCGAGGG 80118270 80118277 CAGAGGG 80118533
#> end2 distance ensembl_gene_id hgnc_symbol gene_range
#> 3930 80101741 132 ENSG00000171298 GAA 80101556-80119881
#> 3931 80105850 234 ENSG00000171298 GAA 80101556-80119881
#> 3932 80110068 49 ENSG00000171298 GAA 80101556-80119881
#> 3933 80118277 9 ENSG00000171298 GAA 80101556-80119881
#> 3934 80118325 41 ENSG00000171298 GAA 80101556-80119881
#> 3935 80118540 256 ENSG00000171298 GAA 80101556-80119881
#> ensembl_transcript_id transcript_range intron_exon intron_exon2
#> 3930 ENST00000302262 80101581-80101890 E1 E1
#> 3931 ENST00000302262 80105133-80105748 I2 E3
#> 3932 ENST00000302262 80109945-80110055 E9 I9
#> 3933 ENST00000302262 80118193-80118357 E18 E18
#> 3934 ENST00000302262 80118193-80118357 E18 E18
#> 3935 ENST00000302262 80118193-80118357 E18 I18
#> feature mismatch
#> 3930 same exon 0
#> 3931 spanning intron-exon 1
#> 3932 spanning intron-exon 1
#> 3933 same exon 1
#> 3934 same exon 1

Please consider.

vjcitn avatar Sep 11 '22 14:09 vjcitn

Thank you for your comment.

GRanges is not adapted for our data, as it only highlights a single range of positions, whereas each row in our data includes 2 sequences with their respective start and end positions, the distance between the two, as well as additional annotation.

lauravongoc avatar Sep 22 '22 10:09 lauravongoc

How about the GenomicInteractions class? How about using DNAString to represent your sequences? Do you use Bioconductor with your data?

vjcitn avatar Sep 22 '22 11:09 vjcitn

Hi, do you plan to work on interoperation of this package with Bioconductor? Please let us know.

vjcitn avatar Oct 11 '22 14:10 vjcitn

Hi, apologies for the delay.

The package files in the repo have been updated with the added implementation of GInteractions class.

lauravongoc avatar Oct 16 '22 23:10 lauravongoc

A reviewer has been assigned to your package. Learn what to expect during the review process.

IMPORTANT: Please read this documentation for setting up remotes to push to git.bioconductor.org. It is required to push a version bump to git.bioconductor.org to trigger a new build.

Bioconductor utilized your github ssh-keys for git.bioconductor.org access. To manage keys and future access you may want to active your Bioconductor Git Credentials Account

bioc-issue-bot avatar Nov 09 '22 13:11 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

On one or more platforms, the build results were: "WARNINGS". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Nov 09 '22 13:11 bioc-issue-bot

Received a valid push on git.bioconductor.org; starting a build for commit id: e952da1ddb5afa92aa47000e19691b020392f2ec

bioc-issue-bot avatar Nov 21 '22 02:11 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

Congratulations! The package built without errors or warnings on all platforms.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Nov 21 '22 11:11 bioc-issue-bot

Hi @lauravongoc

I hope to look at this week, thank you for removing the warnings.

Best wishes, Olly

ococrook avatar Nov 21 '22 11:11 ococrook

HI @lauravongoc

My apologies for the delay - it's be a busy month. Please see the following review before we can accept the package:

  • [x] Bioconductor install instruction in the .readme and vignette
  • [x] What does sysdata.rda in the R folder do? I imagine this needs to be remove or moved
  • [x] package version should match between Descriptions and News
  • [x] R to version >= 4.2
  • [x] FIx a license, variable lincenses are not allowed
  • [x] Do you need to import all of the packages and package functions in the NAMESPACE - export and import selectively if possible
  • [x] Please add unit tests
  • [ ] Avoid use of cat in R functions

Otherwise looks good - thank you!

ococrook avatar Dec 16 '22 15:12 ococrook

Received a valid push on git.bioconductor.org; starting a build for commit id: fddb518e85ed44397557232632463b0a0be49e9f

bioc-issue-bot avatar Jan 16 '23 03:01 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

On one or more platforms, the build results were: "ERROR, skipped". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Jan 16 '23 03:01 bioc-issue-bot

Received a valid push on git.bioconductor.org; starting a build for commit id: 5808626e484667e201eeb8028b6217456f4e0757

bioc-issue-bot avatar Jan 16 '23 04:01 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

On one or more platforms, the build results were: "ERROR, skipped". This may mean there is a problem with the package that you need to fix. Or it may mean that there is a problem with the build system itself.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Jan 16 '23 04:01 bioc-issue-bot

Received a valid push on git.bioconductor.org; starting a build for commit id: fe27a2f3faf71de0acba79b458d04250036e924e

bioc-issue-bot avatar Jan 24 '23 12:01 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

Congratulations! The package built without errors or warnings on all platforms.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Jan 24 '23 12:01 bioc-issue-bot

Dear @ococrook

Thank you so much for the review, and my apologies as well for the delay.

The above points have been addressed. Concerning the 2nd point, the file sysdata.rda in the R directory contains internal data that is parsed by the gene_lookup() function if no path parameter with additional data is not specified.

lauravongoc avatar Jan 24 '23 15:01 lauravongoc

Great, thank you! Does the sysdata.rda have to be in the R folder, could it be moved to the data or inst folders?

ococrook avatar Jan 30 '23 14:01 ococrook

Dear @ococrook,

Putting internal data in sysdata.rda in the R folder was done following the R Packages guidelines I referenced from Hadley Wickham & Jenny Bryan, specifically the Data section. I cannot seem to find a way to direct the calls to this data in the function to another source folder.

lauravongoc avatar Mar 06 '23 02:03 lauravongoc

Yes I think this is okay as long as the sysdata.rda is documented and proper indication of source/licensing referenced (if not in a man page than in a file in inst/scripts)

lshep avatar Mar 06 '23 12:03 lshep

Hi @lauravongoc

Sorry, it looks like I got confused which data was being referred where. As suggested, could you make it clear in the package what the data is and where it came from? If I understand correctly the documentation currently there is for the data in the /data folder?

ococrook avatar Mar 07 '23 11:03 ococrook

Received a valid push on git.bioconductor.org; starting a build for commit id: 28c411219af96bba6eeed8036029b8e2e4ba7d2e

bioc-issue-bot avatar Mar 30 '23 23:03 bioc-issue-bot

Dear Package contributor,

This is the automated single package builder at bioconductor.org.

Your package has been built on Linux, Mac, and Windows.

Congratulations! The package built without errors or warnings on all platforms.

Please see the build report for more details. This link will be active for 21 days.

Remember: if you submitted your package after July 7th, 2020, when making changes to your repository push to [email protected]:packages/EDIRquery to trigger a new build. A quick tutorial for setting up remotes and pushing to upstream can be found here.

bioc-issue-bot avatar Mar 30 '23 23:03 bioc-issue-bot

Dear @ococrook

Indeed the documentation that was already there was for the data in /data. I have now added documentation for the sysdata.R in the /inst/scripts/ folder as suggested.

lauravongoc avatar Mar 30 '23 23:03 lauravongoc

Your package has been accepted. It will be added to the Bioconductor nightly builds.

Thank you for contributing to Bioconductor!

Reviewers for Bioconductor packages are volunteers from the Bioconductor community. If you are interested in becoming a Bioconductor package reviewer, please see Reviewers Expectations.

bioc-issue-bot avatar Apr 11 '23 11:04 bioc-issue-bot

The default branch of your GitHub repository has been added to Bioconductor's git repository as branch devel.

To use the git.bioconductor.org repository, we need an 'ssh' key to associate with your github user name. If your GitHub account already has ssh public keys (https://github.com/lauravongoc.keys is not empty), then no further steps are required. Otherwise, do the following:

  1. Add an SSH key to your github account
  2. Submit your SSH key to Bioconductor

See further instructions at

https://bioconductor.org/developers/how-to/git/

for working with this repository. See especially

https://bioconductor.org/developers/how-to/git/new-package-workflow/ https://bioconductor.org/developers/how-to/git/sync-existing-repositories/

to keep your GitHub and Bioconductor repositories in sync.

Your package will be included in the next nigthly 'devel' build (check-out from git at about 6 pm Eastern; build completion around 2pm Eastern the next day) at

https://bioconductor.org/checkResults/

(Builds sometimes fail, so ensure that the date stamps on the main landing page are consistent with the addition of your package). Once the package builds successfully, you package will be available for download in the 'Devel' version of Bioconductor using BiocManager::install("EDIRquery"). The package 'landing page' will be created at

https://bioconductor.org/packages/EDIRquery

If you have any questions, please contact the bioc-devel mailing list (https://stat.ethz.ch/mailman/listinfo/bioc-devel); this issue will not be monitored further.

lshep avatar Apr 13 '23 13:04 lshep