Wide-scale testing on Earthscope #252

kkappler · 2023-04-09T00:43:35Z

This branch is not expected to modify code in aurora. We can add-back-in the tests when we are ready to merge, in the meantime it just seems silly to run tests on every commit When the time comes to merge, we just need to copy .github/workflows/test.yml back into the repo [Issue(s): #252]

also add folder to hold supporting functions if needed. [Issue(s): #252]

- modify stage 01 to use get_summary_table_name - modify stage 01 to add support for remotes_2 - stage 3 is in dev - not working yet - stage 4 in dev - add EXPERIMENT_PATH as a place to store inventory/metadata (dataless h5s) - factor get_remotes_2 out of get_remotes - add support for summary_table filename make/load issue #252

There is an issue when the time intervals are incorrect, this was handled by returning "None". It was never expected to be encountered, But is seems that there are mth5s with end time earlier than start time. While this should be fixed upstream, for now in order to avoid an exception in building the kernel_dataset, we should at least return the correctly shaped output. Since overlap() method is supposed to return a start_time and and end_time, returning None is not acceptable, but returning None, None is OK (at least structurally) This fix is being inserted to support task: WideScale Testing [Issue(s): #252]

- deprecate unused TMP_FROM_EMTF argument - add testing control param restrict_to_first_n_rows - make SPUD paths a dict, keyed by emtf, data, base [Issue(s): #252]

kkappler · 2023-09-08T18:30:01Z

This task has run an entire first pass with aurora results in reasonable agreement with spud in most cases.

A follow up task is to take the six stages of testing:

00_catalog_SPUD.py
01_test_load_spud_tfs.py
02_test_station_inventory_valid.py
03_test_download_from_earthscope.py
04_test_processing_with_aurora.py
05_compare_tfs.py

and wrap them in a common framework. Towards this I forked a widescale_test branch off of earthscope_tests where each of the six steps can be wrapped as an instance of a WideScaleTest class. The idea is that each test has an output table (defined by a schema), and that the table is prepared into a dataframe and then dask iterates over the df.

I am attaching a previous result csv from stage 00, 01, 02 here for comparison with the updated version.

00_spud_xml_scrape.csv

01_spud_xml_review_2023-09-07_203451.csv

kkappler · 2023-09-28T15:22:33Z

All stages have successfully executed on gadi, as well as on my local machine.

If we were going to do this again, I would make the following updates:

More testing of dask, it was not clear that dask was speeding things up much for stage 01
Merge the h5 files, either into one archive, or at least one archive per survey, rather than one mth5 per station.

For reference, I zipped and attached the summary tables csvs from gadi

summary_tables_gadi_20230928.zip

kkappler added a commit that referenced this issue Apr 9, 2023

add ipynb exploring data_availability

c3ce9b7

also add folder to hold supporting functions if needed. [Issue(s): #252]

This was referenced May 26, 2023

TF XML Ingest kujaku11/mt_metadata#143

Closed

Extraction of Remote Reference Site ID from TF XML #269

Closed

kkappler assigned kkappler and laura-iris Jun 2, 2023

kkappler mentioned this issue Jun 3, 2023

Exception in make_mth5_from_fdsn_client for single channel archive kujaku11/mth5#149

Closed

kkappler added a commit that referenced this issue Jun 19, 2023

add exception message for #252

69b54e7

kkappler added a commit that referenced this issue Jun 20, 2023

add exception handling for processing #252

aa9bc32

kkappler mentioned this issue Jun 23, 2023

Wildcard support for make mth5 from FDSN #277

Open

kkappler added a commit that referenced this issue Jun 29, 2023

Tidy 00_catalog_spud

66da1ff

- deprecate unused TMP_FROM_EMTF argument - add testing control param restrict_to_first_n_rows - make SPUD paths a dict, keyed by emtf, data, base [Issue(s): #252]

kkappler mentioned this issue Jul 22, 2023

Filters not included for all channels kujaku11/mth5#157

Closed

kkappler closed this as completed Sep 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wide-scale testing on Earthscope #252

Wide-scale testing on Earthscope #252

kkappler commented Apr 9, 2023 •

edited

Loading

kkappler commented Sep 8, 2023 •

edited

Loading

kkappler commented Sep 28, 2023

Wide-scale testing on Earthscope #252

Wide-scale testing on Earthscope #252

Comments

kkappler commented Apr 9, 2023 • edited Loading

kkappler commented Sep 8, 2023 • edited Loading

kkappler commented Sep 28, 2023

kkappler commented Apr 9, 2023 •

edited

Loading

kkappler commented Sep 8, 2023 •

edited

Loading