pygmt Organize baseline images by method and/or cache baseline images during GitHub workflows

Description of the desired feature

I've noticed that our test workflows often fail during the dvc pull step due to a failure to download one or two of the baseline images. For example, 8 of the last 10 runs in https://github.com/GenericMappingTools/pygmt/actions/workflows/ci_tests.yaml report at least one job failing for this reason. This is because it is unstable (and slow) to rely on >160 https calls.

This proposal is to restructure the baseline images according to the suggestion in https://github.com/GenericMappingTools/pygmt/issues/1490#issuecomment-917348424, which is to organize the baseline images in directories by method with one .dvc file for each directory rather than a 1:1 match between baseline images and .dvc files. This should reduce the number of https calls from >160 to ~25, which would increase stability and speed. The DAGsHub team previously mentioned that these problems could be eventually fixed by using a different connection protocol, but I don't think we should wait on that.

Another option is to cache the dvc files (both .dvc/cache and pygmt/test/baseline/*.png) similar to how we cache the .gmt files so that the dvc pull step only updates outdated files. But this is really only working around the core issue, so I suggest that this would be in addition to the restructure proposed above.

Are you willing to help implement and maintain this feature? Yes

Sep 15 '22 15:09 maxrjones

rganize the baseline images in directories by method with one .dvc file for each directory rather than a 1:1 match between baseline images and .dvc files. This should reduce the number of https calls from >160 to ~25, which would increase stability and speed.

Sounds good.

Another possible option: running dvc pull twice. Not sure if it works.

Sep 16 '22 01:09 seisman

Here are the number of test images grouped by module-name at commit 561eb41edc9abfcb27839c2f8d9c6b4fd77f4616. Maybe we can start opening individual PRs for the modules with >10 baseline images like plot, text, plot3d, and work our way down the list.

[ ] plot (21)
[ ] text (18)
[ ] plot3d (18)
[ ] grdview (14)
[ ] meca (11)
[ ] config (10)
[ ] basemap (10)
[ ] makecpt ( 9)
[ ] rose ( 7)
[ ] colorbar ( 6)
[ ] grdimage ( 5)
[ ] legend ( 4)
[ ] geopandas ( 4)
[ ] coast ( 4)
[ ] subplot ( 4)
[ ] grdcontour ( 4)
[ ] contour ( 3)
[ ] logo ( 2)
[ ] inset ( 2)
[ ] velo ( 2)
[ ] solar ( 2)
[ ] histogram ( 1)
[ ] grd2cpt ( 1)
[ ] figure ( 1)
[ ] image ( 1)
[ ] ternary ( 1)
[ ] wiggle ( 1)

Not sure if we could just skip the modules with 1 baseline image? Or should we just do all for consistency? We might see at some point that the number of HTTP calls falls below the threshold once the top modules are all completed.

P.S., Here's the script to generate the statistics:

import os
import glob
import pandas as pd

png_images = glob.glob("pygmt/tests/baseline/test_*.png")
df = pd.DataFrame(
    data=[os.path.splitext(os.path.basename(filepath))[0] for filepath in png_images],
    columns=["filename"],
)
df_modulename = df.filename.str.split("_", expand=True)[1]
print(len(df_modulename))
print(df_modulename.value_counts())

Sep 16 '22 14:09 weiji14

Thanks @weiji14 for compiling that information! The plan to start with the methods with many baseline images sounds good to me.

I don't think we'll need to do the methods with just one image if the tests stop failing frequently for this reason.

Sep 16 '22 14:09 maxrjones

Another possible option: running dvc pull twice. Not sure if it works.

Or if you want to give @seisman's suggestion a quick try, could modify https://github.com/GenericMappingTools/pygmt/blob/561eb41edc9abfcb27839c2f8d9c6b4fd77f4616/.github/workflows/ci_tests.yaml#L120-L124 to use https://github.com/nick-fields/retry/tree/v2.8.1#only-retry-after-error. But again, probably not nice to keep pinging DAGsHub with so many HTTP requests :slightly_smiling_face:

Sep 16 '22 14:09 weiji14

I think it's best to try to fix the root of the problem, which is that we are not using dvc in an optimal way.

Sep 16 '22 15:09 maxrjones

I'm on board with the plan to make one .dvc file per module. Is the process as simple as moving all of the baseline images from a specifc module (e.g. plot3d) into pygmt/tests/baseline/plot3d, deleting the corresponding .dvc files, and then using the command dvc add pygmt/tests/baseline/plot3d/?

Sep 21 '22 12:09 willschlitzer

Is the process as simple as moving all of the baseline images from a specifc module (e.g. plot3d) into pygmt/tests/baseline/plot3d,

I don't think it will work. The tests will fail to find the baseline images.

Sep 21 '22 14:09 seisman

I don't think we will organize baseline images by method (i.e., tracking directories rather than tracking individual files) as proposed in the OP. The main reasons are:

We haven't seen dvc failures recently. I think it means connections to the DVC server is very stable and our ~180 dvc files are not that much.
Tracking directories will cause more troubles based on the experiences with the GMT repository (see my comment https://github.com/GenericMappingTools/gmt/issues/5724#issuecomment-1783738234).

So, I'm inclined to close the issue. Feel free to re-open it if you don't agree.

Dec 13 '23 12:12 seisman

pygmt pygmt copied to clipboard

Organize baseline images by method and/or cache baseline images during GitHub workflows

pygmt
pygmt copied to clipboard