Ticket/2450/supplemental/columns #2451

danielsf · 2022-05-31T18:24:19Z

As part of the 2022 VBN release, we are adding some hand annotations not currently represented in the LIMS database to the ecephys_sessions table. Rather than update the schema of the LIMS database (a change that would have implications for all previously-collected ecephys data), we are adding functionality to the VBN metadata_writer that allows us to add columns to the ecephys_sessions table by hand. This PR encompasses that functionality. The new supplemental_columns entry in the metadata_writer schema should look something like this

  "supplemental_columns": [
    {
      "abnormal_activity": false,
      "ecephys_session_id": 1051155866
    },
    {
      "abnormal_activity": false,
      "ecephys_session_id": 1044385384
    },
    {
      "abnormal_activity": false,
      "ecephys_session_id": 1044594870
    },
    {
      "abnormal_activity": false,
      "abnormal_histology": [
        "Hippocampus"
      ],
      "ecephys_session_id": 1056495334
    }]

aamster · 2022-06-01T16:11:44Z

allensdk/brain_observatory/vbn_2022/metadata_writer/metadata_writer.py

+            supplemental_df = pd.DataFrame(
+                    data=self.args['supplemental_columns'])
+
+            columns_to_patch = []


Can't you just use pd.merge here instead of patch_df_from_other?

Even if we did just use pd.merge, I'd want to wrap it in a function that we could test to make sure that the columns we are adding get added the way we expect. patch_df_from_other is already tested. I'd rather keep this as it is.

aamster · 2022-06-01T16:12:15Z

allensdk/brain_observatory/vbn_2022/metadata_writer/schemas.py

@@ -47,6 +47,18 @@ class VBN2022MetadataWriterInputSchema(argschema.ArgSchema):
          "{ecephys_nwb_dir}/{ecephys_nwb_prefix}_{ecephys_session_id}.nwb")
    )

+    supplemental_columns = argschema.fields.List(


I think this argument would be better named as supplemental_data. supplemental_columns makes it seem like it is a list of column names.

aamster · 2022-06-01T16:15:45Z

allensdk/brain_observatory/vbn_2022/metadata_writer/schemas.py

@@ -47,6 +47,18 @@ class VBN2022MetadataWriterInputSchema(argschema.ArgSchema):
          "{ecephys_nwb_dir}/{ecephys_nwb_prefix}_{ecephys_session_id}.nwb")
    )

+    supplemental_columns = argschema.fields.List(


Should this rather be an input file? Passing a long list of dicts through the command line would be cumbersome.

I don't want to proliferate the number of input files we have to keep track of. These input.jsons are large, but they have the virtue of carrying everything we need in one package.

I also don't see users specifying this field (or, or that matter, probes_to_skip) on the command line.

I'd rather leave this as it is.

danielsf added 4 commits May 30, 2022 20:59

test that patch_dataframe_from_other will work as needed

df828f1

add supplemental columns to ecephys_sessions table

5b0e567

shorten metadata cli test

d1be377

test adding supplemental columns

6124aeb

aamster requested changes Jun 1, 2022

View reviewed changes

rename supplemental_columns -> supplemental_data

355425c

aamster approved these changes Jun 1, 2022

View reviewed changes

danielsf merged commit 03bc311 into vbn_2022_dev Jun 1, 2022

danielsf mentioned this pull request Jun 2, 2022

Add columns to VBN metadata tables #2450

Closed

danielsf deleted the ticket/2450/supplemental/columns branch June 8, 2022 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ticket/2450/supplemental/columns #2451

Ticket/2450/supplemental/columns #2451

danielsf commented May 31, 2022

aamster Jun 1, 2022

danielsf Jun 1, 2022

aamster Jun 1, 2022

danielsf Jun 1, 2022

aamster Jun 1, 2022

danielsf Jun 1, 2022

Ticket/2450/supplemental/columns #2451

Ticket/2450/supplemental/columns #2451

Conversation

danielsf commented May 31, 2022

aamster Jun 1, 2022

Choose a reason for hiding this comment

danielsf Jun 1, 2022

Choose a reason for hiding this comment

aamster Jun 1, 2022

Choose a reason for hiding this comment

danielsf Jun 1, 2022

Choose a reason for hiding this comment

aamster Jun 1, 2022

Choose a reason for hiding this comment

danielsf Jun 1, 2022

Choose a reason for hiding this comment