Create events_log table #174

lewismc · 2023-01-11T05:43:37Z

So far we have identified two buckets of anomalies which can occur during ingestions

metadata: when a key exists in the eTUFF global metadata but not in the metadata_types table, or
other: when some anomaly (in the past some time series were empty until they were padded with dummy values) exists within the file

In both of the above cases, each individual offense would generate an separate Slack alert. This can be noisy and overwhelming at time so it needs to be improved.

@renato2099 suggested that we create an anomaly table which would, in the instance of an anomaly` generate an entry detailing what the anomaly is. All anomalies for a given submission would be grouped and persisted for archival purposes. This allows for

A single Slack notification detailing an HTTP location URL of a single aggregated report containing one or more anomalies, and
The ability for the user to then Execute a GET on the URL to access the JSON anomaly report for a given submission

This task therefore requires that we

Design the anomaly table
Link the table to the submission.anomaly_report column which will be available post Augment submission table with ingestion task context and status #173
Augment the OpenAPI to facilitate anomaly report access via GET
Implement the logic to generate anomaly reports which covers the metadata and other buckets described above.
Integrate report alerts into Slack messaging
Tests which cover FAILED ingestion scenarios

The text was updated successfully, but these errors were encountered:

tagtuna · 2023-01-11T10:49:39Z

This captures well the flow - I would point out though, at least with our current design, we aim to utilize two different Slack channels, metadata_ops and deploy_ops, I wonder whether we should flag the anomaly reporting in similar categories, e.g., in the anomaly table, there is atype field with possible values such as "metadata", "missing entries". This value list will grow as we identify more buckets of anomalies?

lewismc · 2023-01-12T04:06:21Z

I really like the sound of that yes. I was also thinking that we could avoid the creation of a new table but add a report column to the submission table however getting data out becomes a bit more tricky because we have to use non-standard/complex data types to represent key-values e.g.
{"metadata", "This is a description of the metadata anomaly"}
... rather than explicit rows which make it really easy to query for all anomalies of a particular type for a given submission.
I think we can implement the dedicated anomaly table with the foreign key and types as you suggested. We don't need to make the anomaly type an ENUM right now.

tagtuna · 2023-01-12T04:09:43Z

I think an anomaly table is a cleaner way to organize and it's easier to use as well. So we don't have to bend ourselves to fit things into submission

lewismc · 2023-01-12T04:17:04Z

Agreed. Thanks. I'll implement.

vtsontos · 2023-01-13T20:07:45Z

HI guys,
thinking a bit more about this, I think could be good to have an "Events_Log" table that would capture the status of all key database event operations, and whether success or anomalies were encountered with whatever descriptive information can be recorded. A standardized event_status code table could be devised. See the attached table proposal with examples.
I think this approach allows us to breakdown and record outcomes for each step in the process in a consistent manner, and should be extensible to allow for additions/changes in future.
Let me know what you think..

Tagbase_EventsTableProposal.xlsx

lewismc · 2023-01-14T20:53:13Z

I like it @vtsontos I'll implement that.

lewismc · 2023-01-15T20:42:53Z

This issue now supersedes #173
Essentially the parts which can be cherry-picked are

CREATE TYPE status_enum AS ENUM ('FAILED', 'FINISHED', 'KILLED', 'MIGRATION', 'POSTMIGRATION', 'PREMIGRATION');

ALTER TABLE ONLY event_log
    ADD CONSTRAINT event_log_submission_fkey FOREIGN KEY (submission_id, tag_id) REFERENCES submission(submission_id, tag_id);

lewismc added enhancement New feature or request storage Anything tagbase-server storage/persistence related. labels Jan 11, 2023

lewismc added this to ICCAT Product Drive Phase 2 (2022-10-15 --> 2023-05-27) Jan 11, 2023

lewismc mentioned this issue Jan 11, 2023

Redesign Slack notifications #175

Open

lewismc changed the title ~~Create anomaly table~~ Create events_log table Jan 14, 2023

lewismc self-assigned this Jan 15, 2023

lewismc moved this to 🏗 In progress in ICCAT Product Drive Phase 2 (2022-10-15 --> 2023-05-27) Jan 15, 2023

lewismc added this to the 0.8.0 milestone Jan 15, 2023

This was referenced Jan 15, 2023

Augment submission table with ingestion task context and status #173

Closed

ISSUE-174 Create events_log table #181

Draft

lewismc removed this from the 0.8.0 milestone Feb 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create events_log table #174

Create events_log table #174

lewismc commented Jan 11, 2023 •

edited

Loading

tagtuna commented Jan 11, 2023

lewismc commented Jan 12, 2023

tagtuna commented Jan 12, 2023

lewismc commented Jan 12, 2023

vtsontos commented Jan 13, 2023

lewismc commented Jan 14, 2023

lewismc commented Jan 15, 2023

Create events_log table #174

Create events_log table #174

Comments

lewismc commented Jan 11, 2023 • edited Loading

tagtuna commented Jan 11, 2023

lewismc commented Jan 12, 2023

tagtuna commented Jan 12, 2023

lewismc commented Jan 12, 2023

vtsontos commented Jan 13, 2023

lewismc commented Jan 14, 2023

lewismc commented Jan 15, 2023

lewismc commented Jan 11, 2023 •

edited

Loading