Fix sync index column order to match order from astropy 4.0 #215

jeanconn · 2021-02-09T16:41:45Z

Description

Set sync index column order to match the alphabetical order that resulted in astropy 4.0. In astropy 4.2 the column order when initializing from a list of dict is the order of keys in the first row. Since astropy 4.2 requires Python 3.7, which has dict ordered, this makes more sense than the previous behavior of just alphabetizing.

Testing

Passes unit tests on linux
[N/A] Functional testing

This showed up in unit test failures with astropy 4.2 in skare 2021.2 env on linux, so the unit tests passing are probably sufficient.

Fixes #216

taldcroft · 2021-02-09T17:35:11Z

Good catch on the issue. One Table API change was that when initializing from a dict, it used to alphabetize the columns, now (since ordered dicts in Py 3.7) it just uses the order from the dict. I haven't looked in our code to see if that is the source of the discrepancy.

I agree with your general approach, though I'm not sure the explicit list of columns and column order is actually necessary. My original code gets a 👎 for requiring a particular order.

jeanconn · 2021-02-09T17:39:57Z

👍 for having a test that caught this though!

jeanconn · 2021-02-09T17:42:18Z

But with regard to "though I'm not sure the explicit list of columns and column order is actually necessary.", I figured the same. Not being as familiar with this codebase, I wasn't sure if there might be some other piece of code not unit tested that had an implicit requirement... or if changing column order in the real (not test) cheta sync archive could cause processing hiccups.

jeanconn · 2021-02-09T20:48:30Z

Well, new index order could cause processing hiccups or it could just break old clients (laptops or versions run from ska3-matlab)... right?

taldcroft · 2021-02-10T14:22:57Z

@jeanconn - I added some commits and this should be good to go now. I'm approving, and you should have a fresh look as well.

taldcroft · 2021-02-10T14:24:28Z

If this were an astropy PR I might think about squashing the commits down to 1, but whatever.

jeanconn · 2021-02-10T15:01:59Z

Ska/engarchive/update_client_archive.py

@@ -476,13 +476,14 @@ def update_full_h5_files(dat, logger, msid_files, msids, opt):
 def get_full_data_sets(ft, index_tbl, logger, opt):
    # Iterate over sync files that contain new data
    dats = []
-    for date_id, filetime0, filetime1, row0, row1 in index_tbl:
+    for row in index_tbl:


And sure 'row' is fine. I probably overthought it... I avoided 'row' as the var just because the lines of the index table defined row spaces.

jeanconn · 2021-02-10T15:06:32Z

Ska/engarchive/update_server_sync.py

@@ -196,9 +196,9 @@ def get_row_from_archfiles(archfiles):
    # date like 2019-02-20T2109z, human-readable and Windows-friendly (no :) for a unique
    # identifier for this set of updates.
    date_id = get_date_id(DateTime(archfiles[0]['filetime']).fits)
-    row = {'filetime0': archfiles[0]['filetime'],
+    row = {'date_id': date_id,


And sure. Removing the explicit list in file_defs and replacing with this strategy to get the columns in the right order seems fine. You have a better handle on astropy development to know which would be more stable in the long-term.
And we presently don't see a driver for keeping the column order in file_defs for reference (which you might want if you thought this could vary based on environment).

Our code should always be written to be tolerant of changes to column order or potentially new columns. My original implementation was the root problem, nothing more.

In astropy we strive for API stability, but sometimes allow cleaning up legacy design mistakes (or in these cases, designs that became mistakes due to language changes).

jeanconn added 3 commits February 9, 2021 11:38

Define sync index column order in file_defs

5562bcf

Write sync index out using col order from file_defs

7753b8b

Use code the doesn't assume column order to read index

08b007f

jeanconn requested a review from taldcroft February 9, 2021 16:41

taldcroft added 3 commits February 10, 2021 09:04

Change row order for legacy consistency

52a3a5e

Revert the addition of sync_index_cols

2f5435a

Change entry to row

830af07

taldcroft changed the title ~~Set sync index column order explicitly~~ Fix sync index column order to match order from astropy 4.0 Feb 10, 2021

taldcroft approved these changes Feb 10, 2021

View reviewed changes

jeanconn commented Feb 10, 2021

View reviewed changes

jeanconn merged commit 7e86bbe into master Feb 10, 2021

jeanconn deleted the index_col_order branch February 10, 2021 16:45

javierggt mentioned this pull request Feb 11, 2021

Update sot/eng_archive to Release 4.52.0 sot/skare3#607

Closed

This was referenced Feb 26, 2021

2021.2 sot/skare3#599

Merged

2021.3 sot/skare3#610

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix sync index column order to match order from astropy 4.0 #215

Fix sync index column order to match order from astropy 4.0 #215

jeanconn commented Feb 9, 2021 •

edited by taldcroft

Loading

taldcroft commented Feb 9, 2021

jeanconn commented Feb 9, 2021

jeanconn commented Feb 9, 2021

jeanconn commented Feb 9, 2021

taldcroft commented Feb 10, 2021

taldcroft commented Feb 10, 2021

jeanconn Feb 10, 2021

jeanconn Feb 10, 2021

taldcroft Feb 11, 2021

Fix sync index column order to match order from astropy 4.0 #215

Fix sync index column order to match order from astropy 4.0 #215

Conversation

jeanconn commented Feb 9, 2021 • edited by taldcroft Loading

Description

Testing

taldcroft commented Feb 9, 2021

jeanconn commented Feb 9, 2021

jeanconn commented Feb 9, 2021

jeanconn commented Feb 9, 2021

taldcroft commented Feb 10, 2021

taldcroft commented Feb 10, 2021

jeanconn Feb 10, 2021

Choose a reason for hiding this comment

jeanconn Feb 10, 2021

Choose a reason for hiding this comment

taldcroft Feb 11, 2021

Choose a reason for hiding this comment

jeanconn commented Feb 9, 2021 •

edited by taldcroft

Loading