PySDM first try at adding a Seeding dynamic

Jul 31 '24 17:07 slayoo

Codecov Report

All modified and coverable lines are covered by tests :white_check_mark:

Project coverage is 84.61%. Comparing base (723e62c) to head (794243c). Report is 4 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1367      +/-   ##
==========================================
+ Coverage   84.30%   84.61%   +0.30%     
==========================================
  Files         373      375       +2     
  Lines        9160     9247      +87     
==========================================
+ Hits         7722     7824     +102     
+ Misses       1438     1423      -15

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Have feedback on the report? Share it here.

Aug 01 '24 21:08 codecov[bot]

Some sanity check tests to add in might be 1) checking that turning on Seeding dynamic but having time_window=0 yields the same results as no seeding, 2) checking that all particles have multiplicity > 0 by end of seeding window, and then a more physical check would be to check that the droplet number concentrations are larger and the droplet sizes are smaller when you have seeding.

Hopefully these will address the warnings from Codecov :)

Aug 05 '24 15:08 claresinger

In https://github.com/open-atmos/PySDM/pull/1367/commits/292d4fb4e388029bda17e8563d4ce2709343523d, there is a seeding/no-seeding plot layout depicting what happens also without seeding

Aug 08 '24 12:08 slayoo

@slayoo I tried to write a unit test for the Seeding dynamic. It's here. https://github.com/claresinger/PySDM/blob/sd_injection/tests/unit_tests/dynamics/seeding/test_parcel_seeding_sanity_checks.py I'm not sure I have permissions to commit changes to this PR... or at least I didn't figure out how to do that, sorry!

Aug 08 '24 15:08 claresinger

@claresinger, thanks! I'd vote for placing this type of test into smoke_tests rather than unit_tests - it checks much more than just the seeding dynamic "unit" (and it takes much more time than a unit test). Also, we could avoid copy-pasting code and run the hello_world notebook via the notebook_vars fixture and assert on its variables (like we did here: https://github.com/open-atmos/PySDM/blob/main/tests/smoke_tests/parcel_a/lowe_et_al_2019/test_fig_s2.py)?

To push to this PR's branch, you should be able to do:

git remote add slayoo [email protected]:slayoo/PySDM.git
git push slayoo sd_injection:sd_injection

Aug 08 '24 15:08 slayoo

great that the push worked, thanks! I'll try to reduce code duplication in the smoke test now

Aug 08 '24 15:08 slayoo

BTW, adding a new folder in smoke tests requires also adding an entry here: https://github.com/open-atmos/PySDM/blob/55e0cce99bd582df5aeb5136289bd94289b687c6/.github/workflows/tests%2Bartifacts%2Bpypi.yml#L128

Aug 08 '24 15:08 slayoo

Also, we could avoid copy-pasting code and run the hello_world notebook via the notebook_vars fixture and assert on its variables...

Looking close, I'd say we can better simplify the test to just use a Box env and a few timesteps, without invoking all the notebook machinery. But let me look at it tomorrow

Aug 08 '24 16:08 slayoo

Also, we could avoid copy-pasting code and run the hello_world notebook via the notebook_vars fixture and assert on its variables...

Looking close, I'd say we can better simplify the test to just use a Box env and a few timesteps, without invoking all the notebook machinery. But let me look at it tomorrow

here it is: https://github.com/open-atmos/PySDM/pull/1367/commits/d3318e68c6c405df145e6fe036c23ffada20a50d

Aug 17 '24 21:08 slayoo

smoke test with asserts on the values in the hello-world notebook (sd_count & rain mixing ratio) added ✓

Aug 20 '24 14:08 slayoo

So, at this point we are missing:

[x] unit tests for sampling logic
[ ] saner handling of pool of candidates for sampling (currently all the vector is reshuffled regardless of how many particles we inject)
[x] more "physics tests", like Clare proposed: checking for reduced concentration (without collisions?), checking for change in sizes (without collisions?), would be great to come up with more (incl. collisions)

Nice to have, but likely OK to be done in another PR:

[ ] multidimensional example (with position attribute sampled)
[ ] example based on literature
[ ] parallelisation of the sampling logic
[ ] GPU support

Aug 21 '24 17:08 slayoo

Hi @slayoo - thanks so much for all the work here! Sorry for the delay - I've been away on a hiking holiday :) Things are looking great. I agree with your assessment of what we need to include in this PR vs. what should wait for future PRs.

For the unit tests of sampling logic, what do we mean by that? Do you mean something like checking if the seeded particle sizes/other attributes do well reproduce the specified distribution of sizes/other attributes? Or do you mean something even simpler, more "unit"-like?

Aug 29 '24 20:08 claresinger

@claresinger

For the unit tests of sampling logic, what do we mean by that? Do you mean something like checking if the seeded particle sizes/other attributes do well reproduce the specified distribution of sizes/other attributes? Or do you mean something even simpler, more "unit"-like?

The sampling logic is external to this PR (i.e. no files changed in PySDM/initialisation/sampling). The minimum we would need here is to assert on the actions performed by newly introduced injection logic, so essentially playing with various settings of the seeding method arguments:

multiplicity,
extensive_attributes,
seeded_particle_index,
seeded_particle_multiplicity,
seeded_particle_extensive_attributes,
number_of_super_particles_to_inject

and asserting if the outcome is consistent with the argument setting.

Aug 31 '24 13:08 slayoo

Remaining items, to be handled in another PR, are documented in issue #1387

Sep 12 '24 17:09 claresinger

Pylint is passing. ✅ But the tests that are passing locally are failing from a RuntimeWarning. ❌

Sep 12 '24 19:09 claresinger

Pylint is passing. ✅ But the tests that are passing locally are failing from a RuntimeWarning. ❌

Thank you, @claresinger ! In the workflow, we invoke pytest with the -We option to treat warnings as errors

Sep 12 '24 19:09 slayoo

I think the problem is that discretise_multiplicities does not expect all values to be NaN, only some. But that excludes the case where there are no initial particles, only seed particles, like we are testing.

I think this is a valid case, so we should update the function and add a unit test for this case.

Sep 12 '24 19:09 claresinger

I think the problem is that discretise_multiplicities does not expect all values to be NaN, only some. But that excludes the case where there are no initial particles, only seed particles, like we are testing.

I think this is a valid case, so we should update the function and add a unit test for this case.

Good point! Thank you!

Sep 12 '24 20:09 slayoo

codecov reports that the only single line missing test coverage is now (particulator.py):

+         if n_null == 0:
+             raise ValueError(
+                 "No available seeds to inject. Please provide particles with nan filled attributes."
+             )

Sep 12 '24 21:09 slayoo

And here's a pattern we could use to actually check what type of exception is thrown in case of errors and if the exception message matches some expected string: https://stackoverflow.com/a/77005620/1487910

Sep 12 '24 22:09 slayoo

codecov reports that the only single line missing test coverage is now (particulator.py):
+         if n_null == 0:
+             raise ValueError(
+                 "No available seeds to inject. Please provide particles with nan filled attributes."
+             )

Does it make sense to include a case with both background and seed SDs in the backend test to ensure this coverage?

Sep 12 '24 23:09 jtbuch

working on the tests now...

Sep 13 '24 12:09 slayoo

@claresinger, @jtbuch, we've made it!

Sep 14 '24 16:09 slayoo

Just ran the code and everything looks good to me!

Sep 14 '24 16:09 jtbuch