earthaccess icon indicating copy to clipboard operation
earthaccess copied to clipboard

[BUG] Two data links returned when accessing AU_SI12_NRT_R04

Open asteiker opened this issue 1 year ago • 6 comments

Is this issue already tracked somewhere, or is this a new report?

  • [X] I've reviewed existing issues and couldn't find a duplicate for this problem.

Current Behavior

Reported by @trey-stafford in separate issue under #307: For AU_SI12_NRT_R04, the granules have two data links which have the same data. One is pretty fast but the other link is slow to download.

results[-1].data_links()
[
    'https://lance.nsstc.nasa.gov/amsr2-science/data/level3/seaice12/R04/hdfeos5/AMSR_U2_L3_SeaIce12km_R04_20241016.he5', 
    'https://lance.itsc.uah.edu/amsr2-science/data/level3/seaice12/R04/hdfeos5/AMSR_U2_L3_SeaIce12km_R04_20241016.he5'
]

This seems like an issue with the CMR metadata itself, but I don't recall how earthaccess is handling multiple get data URLs and whether there should be better behavior to only utilize one of the links.

Expected Behavior

Only a single link per granule should be used for opening or downloading granules from earthaccess instead of duplicating the outputs.

Steps To Reproduce

Copying original sample code from #307:

import earthaccess

results = earthaccess.search_data(short_name='AU_SI12_NRT_R04')
results = sorted(results, key=lambda x: x['meta']['revision-date'], reverse=True)
earthaccess.login()
files = earthaccess.download(results, "/tmp/test")

Environment

Using earthaccess v0.11.0

Additional Context

No response

asteiker avatar Oct 29 '24 21:10 asteiker