conda-forge.github.io icon indicating copy to clipboard operation
conda-forge.github.io copied to clipboard

source code caching?

Open beckermr opened this issue 5 years ago • 18 comments

I am hitting cases where source code from non-github/pypi sources is being either updated but with the same filename or is gone completely. Have we considered caching source code tarballs or enabling CF to do this if desired?

beckermr avatar Aug 18 '19 12:08 beckermr

We have in some cases reuploaded these artifacts on the GitHub release of the feedstock.

jakirkham avatar Aug 19 '19 09:08 jakirkham

Interesting. I was thinking of something more automated where if the conda-forge.yaml has the right setting and the PR is a version bump, it downloads the artifact, uploads to someplace (could be the github release), and then makes a change to the recipe and makes a new PR.

beckermr avatar Aug 19 '19 11:08 beckermr

There was a separate discussion about using split packages to handle this. So one artifact users would produce would be the source.

jakirkham avatar Aug 19 '19 12:08 jakirkham

@scopatz, we have access to a google drive through NumFOCUS right? Is there a limit on the space?

isuruf avatar Aug 28 '19 01:08 isuruf

we do, no limit as far as I know

scopatz avatar Aug 28 '19 02:08 scopatz

@scopatz, can I get access?

isuruf avatar Aug 28 '19 02:08 isuruf

Bioconda is using https://depot.galaxyproject.org/software/ (https://github.com/galaxyproject/cargo-port). Every tarball is mirrored. Are you considering to do this for every package as we do in Bioconda?

bgruening avatar Aug 28 '19 06:08 bgruening

Are you considering to do this for every package as we do in Bioconda?

yes.

How is this server hosted?

isuruf avatar Aug 28 '19 06:08 isuruf

University hosted infrastructure in PSU.

bgruening avatar Aug 28 '19 06:08 bgruening

I am away for a week, but feel free to ask around

scopatz avatar Aug 28 '19 16:08 scopatz

@isuruf, I think it would be awesome if we could store the tar-balls directly in anaconda.org under a different channel, e.g. conda-forge-source so we don't explode the repodata.json.

At the end of the day, we are looking for some Object Store, and I think we have a great one with anaconda.org that has an API we are all familiar.

sodre avatar Sep 04 '19 02:09 sodre

@sodre, you can only upload conda and pypi packages to anaconda.org.

isuruf avatar Sep 04 '19 02:09 isuruf

We could wrap the tar.gz or source files in conda rather easily, wouldn't that satisfy the requirement?

I raised a similar issue to conda-build team about packaging models for Data Scientists. This is not that different.

c.f.: https://github.com/conda/conda-build/pull/3529

sodre avatar Sep 04 '19 03:09 sodre

True. Do you have some script to do that? We can hook it up to conda-forge-ci-setup.

isuruf avatar Sep 04 '19 03:09 isuruf

That wouldn't work, IMO. It'll be harder to replace the file in the recipe

isuruf avatar Sep 04 '19 03:09 isuruf

Let me run a couple of tests...

sodre avatar Sep 04 '19 03:09 sodre

Where did we land on where the sources should be stored?

jakirkham avatar Feb 10 '20 07:02 jakirkham

@wolfv has mentioned that it would be possible to store the source(s) as part of the OCI mirroring strategy. I don't know how tricky that is, or which part of the pipeline is responsible for that, but leaving this info here in case it's needed!

jaimergp avatar Sep 22 '22 11:09 jaimergp