Merged Runs in Frequency Domain #184

kkappler · 2022-06-06T00:03:00Z

This relates to issues #80, #118 , #132.

Using CAS04 as a test to demonstrate can combine data from multiple runs

Tasks

codecov · 2022-06-06T00:07:59Z

Codecov Report

Merging #184 (9671db8) into main (b9f535a) will increase coverage by 0.92%.
The diff coverage is 78.58%.

@@            Coverage Diff             @@
##             main     #184      +/-   ##
==========================================
+ Coverage   69.50%   70.42%   +0.92%     
==========================================
  Files          97       98       +1     
  Lines        5289     5535     +246     
==========================================
+ Hits         3676     3898     +222     
- Misses       1613     1637      +24

Impacted Files	Coverage Δ
aurora/config/metadata/processing.py	`69.07% <ø> (-2.67%)`	⬇️
aurora/config/metadata/stations.py	`60.27% <ø> (ø)`
aurora/sandbox/debug_mt_metadata_issue_85.py	`0.00% <0.00%> (ø)`
aurora/sandbox/mth5_channel_summary_helpers.py	`0.00% <0.00%> (ø)`
aurora/transfer_function/plot/rho_plot.py	`0.00% <ø> (ø)`
aurora/transfer_function/weights/edf_weights.py	`97.22% <ø> (ø)`
aurora/pipelines/transfer_function_helpers.py	`88.50% <50.00%> (+4.59%)`	⬆️
aurora/transfer_function/kernel_dataset.py	`55.97% <55.97%> (ø)`
aurora/pipelines/process_mth5.py	`96.45% <76.92%> (-0.73%)`	⬇️
aurora/pipelines/run_summary.py	`78.84% <78.84%> (ø)`
... and 26 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b9f535a...9671db8. Read the comment docs.

While working on issue#80, and PR184, have noticed that processing config defaults to estimator.engine = "RME_RR". This is fine, but I find I need to specify to use "RME" explicitly when there is only one station. So a couple fixes were added: 1. Processing class now has a validate() method. If there is no RR station, _and_ the estimator.engine is "RME_RR", it gets reset to "RME". Also added the ability to pass a kwarg to ConfigCreator instance called estimator. The kwarg is a dict and if "engine" is a key, it will overwrite the estimator with the corresponding value. The parkfield SS run test was updated to use the config_creator method. cas04 test is usign validate() [Issue(s): #80]

Tested clock zero works when it comes from data, but only on a single run. [Issue(s): #42]

-Added windowing_scheme as a property of decimation_level metadata object, and r eplaced initializations of WindowingScheme() in time_series_helpers, with this p roperty -cleaned up an errant print statement and tidied some docstrings [Issue(s): #42]

clock zero

Fixed a few docstrings, but main change was to review the math in squared coherence calculation. It turns out this is not as inefficient as I had thought, but it can be done a little cleaner. Removed an unneeded conjugation. [Issue(s): #78]

See notes in issue #78 about einsum motiation. Also set show_response_curves to false, getting some matplotlib errors in the CI tests [Issue(s): #78]

Fix issue 78

Using the updated method in mth5 locally (see mth5 issue #105), am now able to process runs c and d for CAS04 as single station. Working on getting a similar h5 built in tests/cas04 [Issue(s): #31, #80]

Allow request list to have mulitple stations and modify channel_summary_to_make_mth5 to groupby station,run rather than just run. Add tests of make multistation mth5 to cas04 tests. [Issue(s): #80]

-replace DatasetDefintion by Dataset and import as TFKDataset, -replace dataset_definition with tfk_dataset [Issue(s): #80, #132]

This is just a stage commit because all tests are passing currently. operate_aurora is not yet working. Need to decide where to put the RunSummary wrangling. [Issue(s): #80, #118, #132]

Add a method to KernelDataset to extract run info, looping over runs. Also, noticed that some synthetic tests were commented out, fixed this. Also, tidied some code in process_mth5. [Issue(s): #181]

Multiple runs now entered into TF XML

Replaced dict with classes. Now have a SyntheticRun and a SyntheticStation. This will be used to create an example synthetic case with many runs [Issue(s): #80]

…age used

Change from timedelta.seconds to timedelta.total_seconds() Remove run_id from sort_by, it should be only station, starttime [Issue(s): #80]

kkappler · 2022-07-03T22:13:00Z

The attached figures indicate that an integrated test using the RR mulitrun clipping to process cas04 with aurora is giving results that are consistent with EMTF in both amplitude and phase outside of the very noisy band of periods shorter than ~30s, which should be processed with coherence sorting.

The good agreement at long period suggests that the management of time intervals is being handled correctly. The casor test will be revisited in issue #31 but these results are enough to justify merging this PR.

add cas04 test outline for merged runs

ac27606

kkappler added 28 commits June 6, 2022 20:02

minor changes

c918726

added placeholder for clock zero

01e632f

change xr.diff to xr.differentiate in prewhitening

2d80928

First Implementation of clock-zero

918dd5e

Tested clock zero works when it comes from data, but only on a single run. [Issue(s): #42]

added another test to pkd for clock_zero_type= to satisfy codecov

45c5ee2

uncomment normal pkd tests

7badc8e

add some doc strings

ee45943

rename main to test to see if codecov will stop complaining

86510b7

Merge pull request #185 from simpeg/fix_issue_42

81a13c0

clock zero

Fix issue 178

11df046

Fixed a few docstrings, but main change was to review the math in squared coherence calculation. It turns out this is not as inefficient as I had thought, but it can be done a little cleaner. Removed an unneeded conjugation. [Issue(s): #78]

added a test for estimate_per_channel in config

58961aa

Replace matrix multiplication with einsum

d3227b8

See notes in issue #78 about einsum motiation. Also set show_response_curves to false, getting some matplotlib errors in the CI tests [Issue(s): #78]

Merge pull request #186 from simpeg/fix_issue_78

48ab653

Fix issue 78

Debugging operate_aurora

e332774

Using the updated method in mth5 locally (see mth5 issue #105), am now able to process runs c and d for CAS04 as single station. Working on getting a similar h5 built in tests/cas04 [Issue(s): #31, #80]

Address handling of multiple stations in mth5

8ba79c3

Allow request list to have mulitple stations and modify channel_summary_to_make_mth5 to groupby station,run rather than just run. Add tests of make multistation mth5 to cas04 tests. [Issue(s): #80]

Update Nomenclature for TF Kernek Dataset

4621282

-replace DatasetDefintion by Dataset and import as TFKDataset, -replace dataset_definition with tfk_dataset [Issue(s): #80, #132]

add method to look at channel summary while working on test

8f9019e

remove tab

d5d4406

suppress inf/nan in stft obj

e97b8a5

Factor method for run_summary from dataset.py, into tf_kernel/helpers.py

5df2271

cleanup doc (a bit)

fea9213

KernelDataset Introduced

8f92815

This is just a stage commit because all tests are passing currently. operate_aurora is not yet working. Need to decide where to put the RunSummary wrangling. [Issue(s): #80, #118, #132]

oops - add file

ef5ca41

bug fix for python 3.8 only

4e3ac6c

move run_summary wrangling into KernelDataset

2fdf720

towards fixing operate_aurora to use kernel_dataset

6609f56

kkappler added 19 commits June 25, 2022 08:23

update operate aurora to use KernelDataset

caae96e

Multiple runs now entered into TF XML

e362296

Add a method to KernelDataset to extract run info, looping over runs. Also, noticed that some synthetic tests were commented out, fixed this. Also, tidied some code in process_mth5. [Issue(s): #181]

Merge pull request #187 from simpeg/fix_issue_181

5bc935c

Multiple runs now entered into TF XML

remove base, add doc to dataset

cef7738

rm unused imports

9c805cc

modify test to use KernelDataset

3cf4879

remove call to extract_run_summaries_from_mth5s, replace with RunSummary

bf95472

merge run_summary helpers into run_summary module

3575381

tidy doc (a little)

ddfe5ea

move tf_kernel/dataset.py to transfer_function/kernel_dataset.py

3d4250a

move run_summary from tf_kernel to pipelines

db97e7e

factor issue out of synthetic tests to its own test

6bf76e2

Major cleanup of synthetic data make method

f324c52

Replaced dict with classes. Now have a SyntheticRun and a SyntheticStation. This will be used to create an example synthetic case with many runs [Issue(s): #80]

added multirun test to synthetic

fc3f1f3

remove duplicate defintion of filters

80bb4a7

replace reference_station_id by remote_station_id so consistent langu…

f42b66d

…age used

Fix duration bug & change sortby columns

a84a169

Change from timedelta.seconds to timedelta.total_seconds() Remove run_id from sort_by, it should be only station, starttime [Issue(s): #80]

restrict_run_intervals_to_simultaneous

da8e175

bug fix

9671db8

kkappler merged commit 27cdc59 into main Jul 3, 2022

kkappler deleted the fix_issue_80 branch July 3, 2022 22:13

kkappler restored the fix_issue_80 branch July 3, 2022 22:15

kkappler deleted the fix_issue_80 branch August 28, 2022 18:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merged Runs in Frequency Domain #184

Merged Runs in Frequency Domain #184

kkappler commented Jun 6, 2022 •

edited

Loading

codecov bot commented Jun 6, 2022 •

edited

Loading

kkappler commented Jul 3, 2022

Merged Runs in Frequency Domain #184

Merged Runs in Frequency Domain #184

Conversation

kkappler commented Jun 6, 2022 • edited Loading

codecov bot commented Jun 6, 2022 • edited Loading

Codecov Report

kkappler commented Jul 3, 2022

kkappler commented Jun 6, 2022 •

edited

Loading

codecov bot commented Jun 6, 2022 •

edited

Loading