-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
component loaders (and data entry) #2974
Comments
I'm up for trying this method. Anything we can do to simplify and make the process consistent across tools would be nice. |
The basics of this are running in test with bulkload identifications. I think its worked out even better than anticipated, but timely feedback would be appreciated.
The form is limited to manage_collection in order to safely (I hope!) accommodate this, and there's a new "shares collection" function which DOES NOT exclude users with locked accounts (so you can load things created by former techs & etc.).
This is implemented and tested, needs propagated to all other loaders "validate" is part of the load process; there's no pre-validation. (Having this as a separate step has been a source of confusion for some time, this process facilitates a much simpler go/nogo approach.) Todo, pending nobody finding a reason to go in a different direction:
|
Now I gotta dig up some stuff to load.... |
/remind me to work on this tomorrow |
@Jegelewicz set a reminder for Jul 31st 2020 |
Cool. I will try too.
…On Thu, Jul 30, 2020, 2:25 PM reminders[bot] ***@***.***> wrote:
* [EXTERNAL]*
@Jegelewicz <https://github.com/Jegelewicz> set a reminder for *Jul 31st
2020*
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#2974 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBHA34YUBTKIKFLHGHDR6HJKJANCNFSM4PL36M5A>
.
|
👋 @Jegelewicz, work on this |
Another major point for this model: it makes replication easy, there's now a testable locality-loader. I'll stop until I get some feedback, I don't want to replicate any problems. The loader-scripts aren't scheduled, you can just open http://test.arctos.database.museum/ScheduledTasks/component_loader.cfm to process from the two new loaders. |
Yea it's not very interactive - check back with the data, should be different. https://github.com/ArctosDB/internal/issues/65
Sorry, I broke it! |
OK, one more observation, when stuff won't load, it would help to get the error along with the csv when you download to fix stuff. So I was able to load 10 localities - none had coordinates - I'll see if I can find a couple that do to try. Also http://test.arctos.database.museum/ScheduledTasks/component_loader.cfm to process from the two new loaders. Needs to have some kind of interactivity...once you go there, you don't get out and we need people to understand that they have accomplished something. Assuming this will be true in production. |
wilco
That's just test - it'll be on the scheduler in production, loading (or errors) will just happen (including for any number of records). |
Clarification - So when I load a file directly to the tool, if stuff passes all the triggers, does it just load or will it always show up in the "manage" page first. Don't know why I can't decide what happens.... |
You can load with status, and if you load with it as "autoload" then Arctos will take care of the rest (or make errors). If you follow the instructions and load from a fresh template then you'd need to set status (which gives you an opportunity to notice that you've just loaded 4582 duplicates...). How that's implemented and documented is a little waffly at the moment, but the potential for "stuff just happens" exists. |
This is in prod, need to integrate eg #2967 (comment) and rebuild all component-loaders under this umbrella. Dropping priority. |
Need to check throttle; currently set for 10 records per run, can be upped significantly but needs monitored as things are added. |
Hey Arctos - Reminder to try and test this by next Thursday! |
See #3300 - make sure status (which can be errors) is urlencoded when necessary |
This has served its purpose, there's a template, it's awesome, closing. |
The next two tasks on my list (#2556, #2442) rely on this template. I can't seem to reconcile #3413 and the related AWG discussion. Do we love this or hate it? Can I keep building these things or do we need more discussion? Do I need to change something going forward? Do I need to change something with the ~dozen loaders I've already built under this model? |
I think the tool is fine. It is just the related "documentation" that needs update, but others should weigh in. |
I think we could just add to the documentation, and I also suggested that
rather than change the text on all the different component loaders, we
change the main bulkloader to use the same terminology, e.g. mark to
autoload vs mark to load etc - with a few sentences explanation on that
form only.
…On Tue, Feb 16, 2021 at 1:51 PM Teresa Mayfield-Meyer < ***@***.***> wrote:
* [EXTERNAL]*
I think the tool is fine. It is just the related "documentation" that
needs update, but others should weigh in.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#2974 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ADQ7JBEV7MQA2RFSXEBCW53S7LLDZANCNFSM4PL36M5A>
.
|
If you mean the catalog record bulkloader, these are fundamentally different tools. The catalog record bulkloader is an independent tool - things in it load or error, that's it. "Component loaders" can have dependencies - things can hang around with 'autoload: ....' for weeks, then be processed after related data becomes available. That is, component loaders have three exits:
I would be in favor of changing the actionable value of |
Maybe we need to think about some way to let people jump to a specific set of data in tools like https://arctos.database.museum/tools/BulkloadOtherId.cfm Currently there is an extra-long list of errors in there and if my username was after this person alphabetically, I'd have to scroll forever to get to my stuff. Maybe just a table at the top that lists the usernames and lets you jump to a specific user's stuff? |
See https://github.com/ArctosDB/data-migration/issues/450#issuecomment-784555912 - verbose errors 400 lucee, need to POST or truncate errors or something. Untested workaround: filter only on username, change status to something shorter. |
Thanks, that worked for my stuff... |
v1.1: csv download should include this to strip unnecessary columns
|
Moved unfulfilled requests to #3463, closing (again). |
The "data entry extras" functionality isn't as good as it could be, loading large batches of various components (identifiers, parts, identifications, etc.) causes timeouts then problems/confusion, the "claim" process for managing data entered via 'data entry extras' causes problems, etc. Let's fix it.
Very tentative suggestions, which may or may not hold up to reality:
status
in "not-yours" records (which would necessarily come with access to records created by users with whom you share collections)A normal load would then be
"Approving" records loaded by you or your students/techs/associates, via any process including data entry extras, would be
I think that would be a significant simplification in both the code and the user experience. "Manage your..." might come with a "pick users" option (a slight increase in complexity), but most of the rest of the complexity (claim, find guid, etc.) that's been introduced for various reasons could be removed.
This has some urgency, I'd like to use #727 as a proof of concept, so I'm adding scary labels and will interpret a lack of immediate objections as enthusiastic approval.
The text was updated successfully, but these errors were encountered: