-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Scalable Harvest does not replace file paths with the appropriate URL prefix #64
Scalable Harvest does not replace file paths with the appropriate URL prefix #64
Comments
When would an ETA be known for this fix? Can I get an order of magnitude? (days, weeks, months)? Pretty sure this takes the registry out of commission for us, except for the smaller bundles, until a fix comes in. |
Hi @mdrum , I did not look at the bug yet, but I am not expecting something hard to resolve. We are also willing to provide you with a script to bulk update the dat which would have been previously loaded by you. Thanks |
That would be great, thanks @tloubrieu-jpl! |
Hi @mdrum , I looked at this issue and from what I am seeing (this is not my code), the configuration of the job is read in a different way in the standalone harvest and the scalable havest and the This is a bug and we will add that feature, but since:
That will take a bit longer than expected. You can use standalone harvest in the meantime, sorry about the frustration of having a brand new tool but missing a critical feature.... I'll give you a new ETA by the end of day. |
🐛 Describe the bug
Using the Scalable Harvest suite of tools, the objects eventually uploaded to the registry do not conform to the replacePrefix directives in the harvest config file. Therefore, the fields that appear in the registry map to the file paths instead of their reachable URLs. I tested the exact same config file on standalone harvest, and the paths were updated correctly.
Resultant example:
📜 To Reproduce
./registry-harvest-cli-1.0.0/bin/harvest-client harvest -j ./harvest-test.xml
with archive bundles at the path specified (/dsk1/www/archive/pds4/non_mission/)🕵️ Expected behavior
Expect fields to look like this:
📚 Version of Software Used
registry-crawler-service-1.0.0
registry-harvest-cli-1.0.0
registry-harvest-service-1.0.0
registry-manager-4.4.0
🩺 Test Data / Additional context
Attached the config used for testing both standalone and scalable harvest
harvest-text.xml.txt
🏞Screenshots
🖥 System Info
🦄 Related requirements
⚙️ Engineering Details
The text was updated successfully, but these errors were encountered: