Hi, this PR closes #2365. The __init__.py file in the sdss package as well as some of the related documentation (in sdss.rst) has been slightly modified to reflect and use the latest SDSS data release (DR17). Some package tests have been run. Please let me know if I need to do anything further. Thank you.

Aug 02 '22 20:08 jsobeck

Hello @jsobeck! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

There are currently no PEP 8 issues detected in this Pull Request. Cheers! :beers:

Comment last updated at 2022-09-11 04:20:09 UTC

Aug 02 '22 20:08 pep8speaks

Codecov Report

Merging #2478 (5f60c09) into main (5c1eefc) will not change coverage. The diff coverage is n/a.

@@           Coverage Diff           @@
##             main    #2478   +/-   ##
=======================================
  Coverage   63.08%   63.08%           
=======================================
  Files         133      133           
  Lines       17313    17313           
=======================================
  Hits        10922    10922           
  Misses       6391     6391

Impacted Files	Coverage Δ
astroquery/sdss/core.py	`92.06% <ø> (ø)`

:mega: We’re building smart automated test selection to slash your CI/CD build times. Learn more

Aug 02 '22 20:08 codecov[bot]

Updating the default data release needs an entry in the change log

I'll give you a few suggestions for any future pull requests you might open.

Write the pull associated issue number in the description of the pull request using one of these phrases so that the issue can be automatically linked to the pull request. Writing the issue number in the title is not helpful.

It would be much better if the pull request title would describe what its purpose is, not how it is implemented. In this case using the title of your first commit as the pull request title would have been much more useful.

Markdown interprets double underscores as bold font specifiers, which is why your pull request description contains init.py instead of __init__.py. You should use backticks to format code or filenames.

Aug 02 '22 21:08 eerovaher

Got it, @eerovaher. Thanks for information. I was moving too quickly and should have been more careful/deliberate. Though I looked at the change log and knew it had to be modified, it was not clear to me that I should be the one to change it (as opposed to a "reviewer"; even after reading through, e.g., CONTRIBUTING.rst).

Aug 02 '22 21:08 jsobeck

The related issue must be mentioned in the pull request description, i.e. the opening post, not the title. You can see that GitHub has not linked the issue because you have used the correct expression in the wrong place.

Though I looked at the change log and knew it had to be modified, it was not clear to me that I should be the one to change it (as opposed to a "reviewer"; even after reading through, e.g., CONTRIBUTING.rst).

Make sure to mention this over at #2462

Aug 02 '22 21:08 eerovaher

There is one remote data test that is failing and has to be fixed. Also, please squash the commits down to one. And I agree with @eerovaher that the docs should contain code blocks to be useful, mentioning what the default is doesn't really bring much to the table without showing how to change it after all the default is not hidden, but listed in the signature and is set by a standard astropy config system. I agree that it can be confusing to users, but then the confusion should be fixed by showing of how to use a different release, with an example.

Aug 02 '22 22:08 bsipocz

I suspect that the broken remote data test may be fixed in #2477.

I also agree that an example with the data release set explicitly would be useful, and we can use such an example to highlight real changes that have happened to the SDSS data sets over the years. Two examples:

Compare redshifts of a small set of objects over data releases.
The photometric calibration was redone in DR13. This is not necessarily widely known, so one could compare fluxes/magnitudes for a small number of photometric objects between say, DR12 (before recalibration) and DR17.

Aug 02 '22 22:08 weaverba137

The test is broken due to the change in the default dr, so definitely belongs to this PR. OTOH, I'm OK with you cherry-picking and squashing down the commits from this PR to yours, so Jennifer gets a commit credit but we're not juggling two PRs at the same time for the same module.

Aug 02 '22 22:08 bsipocz

What about merging #2477 first, then rebasing this PR? I think that may be easier in the long run.

Aug 02 '22 23:08 weaverba137

Agree to the suggestion from @weaverba137 to merge #2477 first (though my input is not the most crucial here).

Aug 02 '22 23:08 jsobeck

@bsipocz, could you say which specific remote data test is failing? I'd like to verify that it is in fact fixed or at least not also broken in #2477.

Aug 02 '22 23:08 weaverba137

__________________________________________ TestSDSSRemote.test_large_crossid ___________________________________________

self = <astroquery.sdss.tests.test_sdss_remote.TestSDSSRemote object at 0x14eb85490>
large_results = <SkyCoord (ICRS): (ra, dec) in deg
    [(334.26483  ,  3.0572122 ), (334.35421  ,  2.9292896 ),
     (334.35896  ,  3....  ( 22.726164 , 20.667794  ), ( 22.655787 , 20.861832  ),
     ( 22.714435 , 20.469003  ), ( 22.713696 , 20.939534  )]>

    def test_large_crossid(self, large_results):
        # Regression test for #589
    
        results = sdss.SDSS.query_crossid(large_results)
>       assert len(results) == 894
E       assert 845 == 894
E        +  where 845 = len(<Table length=845>\n obj_id        objID               ra        ...       obj_id1        type \n bytes7        int64   ...557872910172 ... 1237679541360984660 GALAXY\nobj_999 1237679476942701227 22.7136962885291 ... 1237679476942701227 GALAXY)

astroquery/sdss/tests/test_sdss_remote.py:205: AssertionError

Aug 03 '22 18:08 bsipocz

Thank you, I will take a look at the details there.

Aug 03 '22 18:08 weaverba137

I've been able to reproduce this. The issue is that the input data large_results is not necessarily deterministic even within a given data release, and certainly not deterministic from one data release to another.

Ultimately, what is test_large_crossid testing? If it is simply to ensure that the queries can handle a ~ 1000-item data set, there should be a better way to generate the input data deterministically.

Aug 03 '22 18:08 weaverba137

Thank you @jsobeck!

Sep 11 '22 04:09 bsipocz

@bsipocz, I guess the resolution to the failed test was just to mark DR17 as default?

I'd still like to know the original purpose of test_large_crossid, if that information still exists?

Sep 13 '22 17:09 weaverba137

Yes, the "fix" was to update to the result of DR17.

As for your second question, it was a regression reported in https://github.com/astropy/astroquery/pull/589

Sep 13 '22 18:09 bsipocz

(so I suppose we could change the test to not assert on the actual number, but that it runs and returns results, but I don't think it's a big deal to leave it as is)

Sep 13 '22 18:09 bsipocz

OK, thanx. I'll go back and read that.

Sep 13 '22 18:09 weaverba137

astroquery
astroquery copied to clipboard

Modifying astroquery/sdss package init file to use most current SDSS DR (as well as associated documentation)

Comment last updated at 2022-09-11 04:20:09 UTC

Codecov Report

astroquery astroquery copied to clipboard

Modifying astroquery/sdss package init file to use most current SDSS DR (as well as associated documentation)

Comment last updated at 2022-09-11 04:20:09 UTC

Codecov Report

astroquery
astroquery copied to clipboard