earthaccess
earthaccess copied to clipboard
[BUG] Two data links returned when accessing AU_SI12_NRT_R04
Is this issue already tracked somewhere, or is this a new report?
- [X] I've reviewed existing issues and couldn't find a duplicate for this problem.
Current Behavior
Reported by @trey-stafford in separate issue under #307: For AU_SI12_NRT_R04, the granules have two data links which have the same data. One is pretty fast but the other link is slow to download.
results[-1].data_links()
[
'https://lance.nsstc.nasa.gov/amsr2-science/data/level3/seaice12/R04/hdfeos5/AMSR_U2_L3_SeaIce12km_R04_20241016.he5',
'https://lance.itsc.uah.edu/amsr2-science/data/level3/seaice12/R04/hdfeos5/AMSR_U2_L3_SeaIce12km_R04_20241016.he5'
]
This seems like an issue with the CMR metadata itself, but I don't recall how earthaccess is handling multiple get data URLs and whether there should be better behavior to only utilize one of the links.
Expected Behavior
Only a single link per granule should be used for opening or downloading granules from earthaccess instead of duplicating the outputs.
Steps To Reproduce
Copying original sample code from #307:
import earthaccess
results = earthaccess.search_data(short_name='AU_SI12_NRT_R04')
results = sorted(results, key=lambda x: x['meta']['revision-date'], reverse=True)
earthaccess.login()
files = earthaccess.download(results, "/tmp/test")
Environment
Using earthaccess v0.11.0
Additional Context
No response