Review Role of local_run_obj in export_tf #181

kkappler · 2022-05-30T19:27:55Z

In process_mth5.py there is a note that says to do this review.

Basically, there is the following snippet of code being executed after the pipeline to create the mt_metadata TF object:

        tf_cls = export_tf(
            tf_collection,
            station_metadata_dict=station_metadata.to_dict(),
            survey_dict=survey_dict
        )

The tf_collection is an aurora data structure that tracks the TF values, per decimation level. These TFs can be made out of many runs.

So, we need to make sure that the TF knows which runs were used to generate it

in the current code, we are doing this:

local_run_obj = dataset_df["run"].iloc[0]
station_metadata = local_run_obj.station_group.metadata
station_metadata._runs = []
run_metadata = local_run_obj.metadata
station_metadata.add_run(run_metadata)

What this means is that only the first run is being scraped for metadata here. But it looks like there is facility for adding the other run metadata.
Here is a comment from the code in this area:

# There is a container that can handle storage of multiple runs in xml, Anna made something like this.
# N.B. Currently, only the last run makes it into the tf object,
# but we can simply iterate of the run list here, getting run metadata
# station_metadata.add_run(run_metadata)

So I will try implementing this iterator.

The text was updated successfully, but these errors were encountered:

…, tidy comments, create issue #181

kkappler · 2022-05-30T20:28:00Z

The code should look something like this, and be made a method of DatasetDefinition, called something like:

dataset_definition.get_station_metadata_for_tf_archive()

        #get a list of local runs:
        cond1 = dataset_df["station_id"]==processing_config.stations.local.id
        sub_df = dataset_df[cond1]
        #sanity check:
        run_ids = sub_df.run_id.unique()
        assert(len(run_ids) == len(sub_df))
        # iterate over these runs, packing metadata into 
        station_metadata = None
        for i,row in sub_df.iterrows():
            local_run_obj = row.run
            if station_metadata is None:
                station_metadata = local_run_obj.station_group.metadata
                station_metadata._runs = []
            run_metadata = local_run_obj.metadata
            station_metadata.add_run(run_metadata)

That will replace this block of code:

        station_metadata = local_run_obj.station_group.metadata
        station_metadata._runs = []
        run_metadata = local_run_obj.metadata
        station_metadata.add_run(run_metadata)

However, testing this method is premature since we need to first add a test that processes multiple runs.

…ve multirun testing running

Add a method to KernelDataset to extract run info, looping over runs. Also, noticed that some synthetic tests were commented out, fixed this. Also, tidied some code in process_mth5. [Issue(s): #181]

kkappler · 2022-06-25T17:05:06Z

PR 187 solves this issue

kkappler added a commit that referenced this issue May 30, 2022

remove unused list objects now that df is main container for pipeline…

dedf5b6

…, tidy comments, create issue #181

kkappler added a commit that referenced this issue May 30, 2022

added a prototype, commented solution for issue #181, for after we ha…

a8c6e17

…ve multirun testing running

kkappler mentioned this issue Jun 25, 2022

Merged Runs in Frequency Domain #184

Merged

20 tasks

kkappler mentioned this issue Jun 25, 2022

Multiple runs now entered into TF XML #187

Merged

kkappler closed this as completed Jun 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Review Role of local_run_obj in export_tf #181

Review Role of local_run_obj in export_tf #181

kkappler commented May 30, 2022 •

edited

Loading

kkappler commented May 30, 2022

kkappler commented Jun 25, 2022

Review Role of local_run_obj in export_tf #181

Review Role of local_run_obj in export_tf #181

Comments

kkappler commented May 30, 2022 • edited Loading

kkappler commented May 30, 2022

kkappler commented Jun 25, 2022

kkappler commented May 30, 2022 •

edited

Loading