iris Split attrs

Split attrs

Open pp-mo opened this issue 1 year ago • 2 comments

🚀 Pull Request

WIP - just testing, for now

Ultimately, will address #3325

Sep 12 '22 14:09 pp-mo

Rebased + targetting new feature-branch "FEATURE_split_attrs"

Sep 15 '22 17:09 pp-mo

UPDATE: I have since converted the rough plan outlined in this comment into tickets on a new project to cover all of this work

Note : a rough plan for the future development path...

initial tests for status-quo
- now basically done, though more may still be needed / to be added later
cube metadata changes
- the cube state will contain separate 'local' + 'global' attribute dicts
cube attributes handler
- "cube.attributes" to become a wrapper object, providing access ~same as existing (combined local+global), plus specific local + global settings. (Have code for this prototyped already)
netcdf loader changes
- specifically, assign incoming attributes into relevant local+global areas when loading netcdf
- NOTE: the 'combined access' means that, at this point, saving to netcdf will have almost-same results.
netcdf saver changes
- changes to respect the local/global designations on saving (e.g. from original load provenance), where possible
- PLUS: a FUTURE flag implementation to switch new/old handling behaviour
?? cube printout ??
- COULD modify this to have an extra section.
- i.e. "Attributes:" (local) and separate "Global Attributes:"
- not yet clear if this is required / a good idea
  - NB existing listing is presented in alphabetical order
  - so this is going to change A LOT of test results
adjust "equalise_attributes"
- at least ensure that operation makes sense with the new split-form attributes
- needs to handle newly-possible collisions between matching local+global settings
- possibly provide automatic resolution of some such problems
- possibly add extra keywords to control different resolution modes
UserGuide update

other questions ...

? special additional logic in common-metadata / lenient logic, allowing combine+compare of local+global settings ??
? POSSIBLY ? more fixes / rationalisation of existing code
- adjust LimitedAttributeDict to conform to _CF_ATTRS
- modify LimitedAttributeDict to complain about mis-assignment of recognised global-only or local-only attributes
  - NOTE: this probably can't / shouldn't occur in the CubeAttributesMap
    - "should" instead occur within the individual (specific) sets
- worry about handling of "missing_value" attribute : suspect there may be an anomaly here
- rationalise attribute handling : separate logic to separate code, apart from netcdf-save ??

Sep 15 '22 17:09 pp-mo

I've now created a project for this entire effort, and re-targetted this PR onto a dedicated feature branch.

So this PR is now represents only the very first step of that (#4981), and should be evaluated, fixed and merged as it is.

Please review for sense, and completeness of the testing scope

Sep 23 '22 15:09 pp-mo

NOTE: rebased onto updated feature-branch + fixed, following the mergeback which contained #4803

Sep 28 '22 17:09 pp-mo

@lbdreyer I hope recent commits should cover a lot of your outstanding comments. But a couple of things are clearly still outstanding

whether I should be testing load- and save behaviours independently of a round-trip approach
- I suspect you may be right, but changes from this code would be quite involved, so perhaps we should consider it as a separate follow-on.
  I'm still thinking on this one.
how to fix the parallel tests
- I have no idea what the problem here is, or why it only seems to occur with Python 3.10, which feels to me like a PyTest problem.
  I will un-fix this + see what happens now. I could also try a rebase to bring in newer dependencies, but AFAICT there is nothing much in the latest iris/main lockfiles that seems like it is going to affect this.

Oct 26 '22 11:10 pp-mo

Updates ...

@lbdreyer ... a couple of things are clearly still outstanding

* whether I should be [testing load- and save behaviours independently of a round-trip approach](https://github.com/SciTools/iris/pull/4960#discussion_r987733199)

I am working on this. I intend to add some specific load- and save-behaviour tests, probably in addition to the round-trips, but still working out what.

* how to [fix the parallel tests](https://github.com/SciTools/iris/pull/4960#discussion_r987285790)

Well, you'll see that I came up with a workaround, but it's really rather horrible ! As we're working on a feature-branch here, I'd rather pause on a better solution for this, and just move forwards noting the problem ?

Oct 27 '22 14:10 pp-mo

iris iris copied to clipboard

Split attrs

🚀 Pull Request

iris
iris copied to clipboard