Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

As a Node Operator, I want to specify an alternate file paths for 1 or more archive products #123

Open
jordanpadams opened this issue May 19, 2023 · 0 comments

Comments

@jordanpadams
Copy link
Member

jordanpadams commented May 19, 2023

Checked for duplicates

Yes - I've already checked

πŸ§‘β€πŸ”¬ User Persona(s)

Node Operator, Data User

πŸ’ͺ Motivation

...so that I can load data into the registry where we compressed the archive product and removed the uncompressed version from the filesystem prior to harvesting the metadata

πŸ“– Additional Details

No response

Acceptance Criteria

Given
When I perform
Then I expect

βš™οΈ Engineering Details

Proposed design:

  1. New β€œ--compressed-mapping” argument to Harvest - CSV table that includes LIDVID,compressed_filename

  2. Harvest will attempt to load all products in bundle/directory like normal

  3. If product LIDVID == LIDVID from CSV table
    a. Add additional step to read the compressed_filename and add the following metadata to the Registry:

             "ops:Data_File_Compressed_Info/ops:creation_date_time": { "type": "date" },
             "ops:Data_File_Compressed_Info/ops:file_ref": { "type": "keyword" },
             "ops:Data_File_Compressed_Info/ops:file_name": { "type": "keyword" },
             "ops:Data_File_Compressed_Info/ops:file_size": { "type": "long" },
             "ops:Data_File_Compressed_Info/ops:md5_checksum": { "type": "keyword" },
             "ops:Data_File_Compressed_Info/ops:mime_type": { "type": "keyword" },
    

    b. If //File/file_name not found, decompress compressed_filename into temp space, get the applicable ops:Data_File_Info metadata, and cleanup

    • Note: This should support .zip, .tar.gz, .gz, and .fz

Some questions:

  • Should this be separate ops:Data_File_Compressed_Info class or under ops:Data_File_Info as something like ops:Data_File_Info/ops:compressed_file_ref?
@jordanpadams jordanpadams self-assigned this May 19, 2023
@jordanpadams jordanpadams changed the title As a Node Operator, I want to specify an alternate compressed filename for 1 or more archive products As a Node Operator, I want to specify an alternate file paths for 1 or more archive products Sep 22, 2023
@github-project-automation github-project-automation bot moved this to Release Backlog in B14.1 Sep 22, 2023
@github-project-automation github-project-automation bot moved this to Release Backlog in EN Portfolio Backlog Nov 20, 2023
@github-project-automation github-project-automation bot moved this to Release Backlog in B15.0 Apr 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Release Backlog
Status: ToDo
Status: Release Backlog
Development

No branches or pull requests

1 participant