Ecarton/cumulus 3751 s3 task #3910

etcart · 2025-01-28T21:22:38Z

Summary: 3751 just the s3 copy part
Addresses CUMULUS-3751: Move granules across collections

Changes

task for copying granules' s3 files from one collection to another
workflow and integration test for this s3 file copy, which is intended to be expanded to include the rest of the workflow as those are ready

PR Checklist

Update CHANGELOG
Unit tests
Ad-hoc testing - Deploy changes and test manually
Integration tests

* Jk/cumulus 3940 18.5.x (#3877) * Update message recovery/granule write logic to properly use esClient This commit updates the following: - esClient is properly passed through api lambda/lib methods such that write granules calls from process-s3-dead-letter-archive can pass in an instance of EsClient rather than relying on default per-granule object/client behavior - The API endpoint and related code are updated such that maxDbPool, concurrency and batchSize are exposed as endpoint options, allowing user customization of tuning behavior for the DLA recovery tool - Minor typing/call fixes * Update Core to allow DLA recovery configuration This commit updates: - archive/cumulus/example to pass through memory configuration options to the fargate task definition * Add api performance test * Update docs/changelog * Update CHANGELOG and documentation * Update CHANGELOG * Fix linting * Fix units * Update dead letter archive feature doc * Update test spec * Update logging, make perf test script executable * Fix broken package.json ava exclusion configuration * Add zod parsing to dead letter endpoint * Update tf-modules/archive/async_operation.tf Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com> * Update tf-modules/archive/async_operation.tf Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com> * Address db pool configuration concern in PR * Update env config passthroughs/make log/docs consistent * Update tf-modules/archive/async_operation.tf Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com> * Update tf-modules/archive/async_operation.tf Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com> * Update per PR suggestion * Update concurrency defaults for consistency * Update startAsyncOperations to allow for optional container names * Update dead letter archive endpoint to specify new container name * Update API defaults/units to 30 to match system defaults * Fix defaults for endpoint tests * Add changed params to demonstrate payload handling * Updarte coverage metric Updated code in this module doesn't significantly impact test coverage, other than increasing the denominator. * fixup * Update performance tests to match documented defaults --------- Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com> * Update docs to add variables.tf link to default values for new config options * Minor/formating edit * Fix bad merge/remove invalid jsdoc param * Minor edit/add space to variables file --------- Co-authored-by: jennyhliu <34660846+jennyhliu@users.noreply.github.com>

tasks/change-granule-collection-s3/src/index.ts

tasks/change-granule-collection-s3/src/update_cmr_file_collection.ts

Jkovarik · 2025-02-14T18:10:33Z

tasks/change-granule-collection-s3/src/index.ts

+  cmrObjects: { [granuleId: string]: Object },
+  s3MultipartChunksizeMb?: number,
+}): Promise<void> {
+  const sourceGranulesById = keyBy(sourceGranules, 'granuleId');


One future proofing thought comment. The duplicate granule work will very likely result in granules that are not unique by granuleId. Our datastore already doesn't enforce it, just our API and ingest code. Obviously this task in context is fine, but we should be careful in the rest of the PR that we're not burying a related concern.

how else might a granule be uniquely identified to sync them? certainly it will be necessary in the duplicate_granule stuff to be able to do that. obvs granuleID_collection is the obvious way but won't work here

Jkovarik · 2025-02-14T19:50:51Z

packages/cmrjs/tests/cmr-utils/test-cmr-utils.js

+  t.is(updated.Granule.Collection.VersionId, 'b');
+});
+
+test('updateCmrFileCollections updates Echo10Files when missing', (t) => {


Change test title

Also if there's a granule flag but no collection, it's still writing that right? Add test coverage if accurate.

Jkovarik · 2025-02-14T19:52:17Z

packages/cmrjs/tests/cmr-utils/test-cmr-utils.js

+  });
+});
+
+test('updateCmrFileCollections updates umm meta file', (t) => {


It's not really updating a meta file, it's updating the passed in CMR meta object - update test title.

Jkovarik · 2025-02-14T19:55:56Z

tasks/change-granule-collection-s3/schemas/output.json

+        }
+      }
+    },
+    "oldGranules": {


Task: double check this matches before merging

tasks/change-granule-collection-s3/README.md

Jkovarik · 2025-02-14T20:03:01Z

tasks/change-granule-collection-s3/src/index.ts

+} from '@cumulus/cmrjs';
+import { runCumulusTask } from '@cumulus/cumulus-message-adapter-js';
+import { s3 } from '@cumulus/aws-client/services';
+import { BucketsConfig } from '@cumulus/common';


Fix @cumulus/common multi-import

done
18176c7

tasks/change-granule-collection-s3/src/index.ts

Jkovarik · 2025-02-14T20:57:20Z

tasks/change-granule-collection-s3/src/index.ts

+      Bucket: targetFile.bucket,
+      Key: targetFile.key,
+    }),
+    { retries: 5, minTimeout: 2000, maxTimeout: 2000 }


I think we need to add logging to these retries, e.g. https://github.com/sindresorhus/p-retry

Nit: consider retry/time deploy configuration

Co-authored-by: Jonathan Kovarik <Jkovarik@users.noreply.github.com>

* Add pRetry logging to all pRetry calls * Add optional chaining to logstrings

etcart and others added 30 commits December 31, 2024 11:18

gottta get it from cumulus duh

ae96f3d

mistake updated in ecs naming

75bb1d3

slightly misnamed

080441f

adding api-client tests + removing extraneous changes

d61d2f6

naming fix

0850a6c

update to id name

8bf1f4e

small indexer change

dfd5192

pmap concurrency limited

32acc20

reverting some changes

cc02bfd

reduce memory size to see if it needs to scale down to fit

55232b3

redefinition of workflow with a usable payload in

e6992e0

json typo

adedba0

see if this is a helpful debug output

f1964c6

WIP

9d62575

adding changes from code review

c6777ca

removing parent from esUpdate

9de367f

commenting out some es temp. + some PR feedback for formatting

cb39dad

commenting out more ES

d8bccf0

adding back in ES update after some changes

f218ffc

changes to get ES working + linting

668542f

chaging back from 2 to 50 in tests

0839c3d

PR feedback (zod) + fixing concurrency

548fda5

decreasing num granules 50 -> 25 for test

18d5353

removing res.send for batchPatch

76975a6

adding back in res.send + lowering numbers

93756a8

adding console.log for test

0fa0015

added more error logging

879d146

removing all console.log

76cb9a1

bumping concurrency

8bd526c