Add S3 SQS Data Event Notification message as metadata to records #3641
Labels
enhancement
New feature or request
plugin - source
A plugin to receive data from a service or location.
Is your feature request related to a problem? Please describe.
We have a requirement to produce reports showing the time difference between a file been actioned upon (uploaded, generally) in S3 and the time that we process it in data-prepper
This page outlines the data provided to "SQS" https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-content-structure.html
From that my understanding at present is that only the
Records.s3.bucket.name
andRecords.s3.object.key
are used and exported to the event.Ideally we would want the entire S3 data event message to be available to be used as metadata to be "attached" to the record. this would allow us to extend our requirement to possibly, say this file was uploaded by this "person/process/thing" at y time, we "recieved" it as x time, and the event had an original timestamp of z time.
So if this event ended up in Opensearch we may have an extra set of data along the lines of
With the above we would be able to work out total latency from event emitted to event ingested in opensearch in this example. Now if this time was prolonged we would hopefully be able to determine where the latency was introduced. This would help us measure our SLA's / SLO's accurately.
Please note I may have mixed terms etc, but hopefully I have got the gist across, of what we are looking for an why.
Describe the solution you'd like
We would like the S3 Data Event message to be attached as "metadata" to the records processed in a S3 sourced file
Describe alternatives you've considered (Optional)
We would need to build some solution, to read our final output, go search the S3 bucket, get the obect time and then update the record in the final output
Additional context
Off the back of this very brief discussion here #3626
The text was updated successfully, but these errors were encountered: