Merge Strategy #2766

djkirkham · 2017-09-11T13:48:57Z

It seems Iris' strategy when merging cubes is to create the maximum number of dimensions it can, even if there is an 'acceptable' cube with fewer dimensions (i.e., a cube with no anonymous dimensions). For example, cubes with scalar coordinates taken from the following sets of points will merge into a 2 x 3 cube, but could also merge into a 1D cube of size 6:

A=[1,1,1,2,2,2]
B=[1,2,3,1,2,3]
C=[1,2,3,4,5,6]

I've recently had a user report a situation where this strategy resulted in a cube which seemed wrong. A PP load operation resulted in a cube with the following signature:

Heavyside function on pressure levels / (1) (forecast_period: 7; realization: 3; forecast_reference_time: 14; latitude: 325; longitude: 432)
     Dimension coordinates:
          forecast_period                                   x               -                           -             -               -
          realization                                       -               x                           -             -               -
          forecast_reference_time                           -               -                           x             -               -
          latitude                                          -               -                           -             x               -
          longitude                                         -               -                           -             -               x
     Auxiliary coordinates:
          time                                              x               -                           x             -               -
     ...

But it seems more sensible for there to be a single time dimension, rather than separate forecast_period and forecast_reference_time dimensions.

The text was updated successfully, but these errors were encountered:

rcomer · 2017-09-12T11:17:47Z

I can't help thinking that whether this is right or wrong is in the eye of the user, and depends on how they want to use the data. The above example looks very much like the data I use, and the way Iris has organised it seems sensible enough to me.

It's possible that the user needs time to be on a single dimension (e.g. if they want to use cube.extract or cube.aggregated_by). So finding ways to give the user more control on the cube's shape would be good. I think @niallrobinson once proposed a keyword for merge that would let the user specify the desired dim-coords, but I can't seem to find the relevant issue/PR now.

I have my own function auxcoord_flatten, which reshapes a cube so that a specified 2d auxcoord becomes 1d. It involves lots of slicing and I'm sure could be done better by someone who knew what they were doing. If so could be a useful addition to e.g. iris.util?

pelson · 2018-02-14T11:36:39Z

First things first, merge is designed to maximise the number of dimensions whilst without creating missing data. The principle behind this is that the more dimensions you have, the more degrees of freedom you have to do analysis on your cube. In general, it is not possible to structure the data in a way that is always what the user wants - sometimes you want a long thin dimension, sometimes you want many short dimensions.

Let's take the above as given, I doubt we would change the behaviour of merge in this instance. It has made a heuristic choice that in many circumstances was a really good one. The logical next step is to provide functionality to the user that makes it fast to swap dimensions around - perhaps even producing higher dimensional cubes that do contain empty data.

I'm sure there are a number of good API options for this - xarray may be a good source of inspiration on the matter.

In summary: merge is unlikely to change its strategy based on this example, but I believe you are describing useful functionality that sits alongside merge/concatenate.

github-actions · 2022-01-18T00:55:18Z

In order to maintain a backlog of relevant issues, we automatically label them as stale after 500 days of inactivity.

If this issue is still important to you, then please comment on this issue and the stale label will be removed.

Otherwise this issue will be automatically closed in 28 days time.

github-actions · 2022-02-15T01:00:09Z

This stale issue has been automatically closed due to a lack of community activity.

If you still care about this issue, then please either:

Re-open this issue, if you have sufficient permissions, or
Add a comment pinging @SciTools/iris-devs who will re-open on your behalf.

pelson added the Type: Enhancement label Feb 14, 2018

duncanwp mentioned this issue May 24, 2018

Adding a utility for flattening Aux Coords #3030

Closed

pp-mo added the Votable Feature label May 1, 2019

rcomer added the Feature: Merge/Concatenate label Sep 4, 2020

github-actions bot added the Stale A stale issue/pull-request label Jan 18, 2022

github-actions bot closed this as completed Feb 15, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge Strategy #2766

Merge Strategy #2766

djkirkham commented Sep 11, 2017

rcomer commented Sep 12, 2017 •

edited

Loading

pelson commented Feb 14, 2018

github-actions bot commented Jan 18, 2022

github-actions bot commented Feb 15, 2022

Merge Strategy #2766

Merge Strategy #2766

Comments

djkirkham commented Sep 11, 2017

rcomer commented Sep 12, 2017 • edited Loading

pelson commented Feb 14, 2018

github-actions bot commented Jan 18, 2022

github-actions bot commented Feb 15, 2022

rcomer commented Sep 12, 2017 •

edited

Loading