-
-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Delay in viewing specimen records after entry #6705
Comments
Friday Sep 1 after Arctos came back on line, I added a new identification to 4 specimens but searching now I am not finding them - with many different search strategies to find them. It's as if the IDs I added vanished. |
One has appeared now. Probably rookie question on my part @dustymc - but can you help me understand what happened? Just a general backlog, or some error in data entry that slowed it down? The reason being I was hoping to enter a mineral a day for the "Mineral Cup" 2023 bracket! My fossil records all appeared pretty quickly. But if I can't link to the mineral record on the same day I'll abandon that ship and focus on the real work (random data entry by bracket not exactly the most efficient approach anyway) |
@ufarrell I'm not sure how to say it better than what's on the top of the page, but clearly someone needs to! I have limited resources so I have to pay the costs up front (when something changes, rather than when something's requested) by caching the core of the complex data into a simpler (so cheaper to query) structure. 90-some-odd-something percent of the time, that happens within a minute. When someone makes some huge change (or occasionally when there's some sort of problem) it can take days to catch up. I'm not sure if this is related or not, but around this time I noticed that someone messed with https://arctos.database.museum/name/unidentifiable#Arctos which set a millionish records to update. If anything ever happens on #6111 it'll take several days to catch up. Etc. I usually have some flexibility in my side of that, and I can prioritize one (or a dozen, or MAYBE a hundred) records ahead of some huge mass by request. @ArctosDB/documentation (I think maybe that's not the current project - help @ebraker @Jegelewicz !) - help making the blurb at the top of https://arctos.database.museum/info/flat_status.cfm better (or something to use as pagehelp or whatever) please? |
Thanks Dusty, I get it now. Yes, I think a definition of status would be good - 'status X = in progress' Seems obvious now, but when I returned to this I had a vague memory of a third status, but couldn't remember what it said, and I wasn't sure if there was some error indicated by the error-in-processing that maybe I needed to go away and find. Also had a brief moment of wondering if "current" meant "currently in progress". If possible, maybe keeping that third status visible, even when NumberRecords = 0 would be informative as well |
That's helpful, but the next step for me is what are the errors? Are they mine? what should I be doing? Any way to give us that? |
If you can do anything about them then you'll have no problem finding them.... Just ping me. I try to keep on top of errors, it doesn't always work. (That one was what almost all of them are, someone swaparooing a GUID around. I don't have a technical and sustainable solution for that and the situation would not be allowed if I had my way so I just cringe and force-refresh.) |
What does that mean exactly? |
Can we get some description on the status page as to what current, flat, filtered flat etc mean, and what it means if there are large vs small numbers there? It still isn't obvious what numbers in those various columns mean in terms of potential wait times. |
The form help text at the top of the page is better (except for a typo)
Only comment is to fix typo ( |
Commenting here because this Issue is linked from flat status. Someone (or some intersection, I'm not sure) loaded a bunch of 'no data' records, which resulted in a very large locality (and maybe event) merge, after which tens (hundreds?) of thousands of records need to refresh. This resulted in
How can we better communicate any of this? |
Addressing @AdrienneRaniszewski comment from #7946 (comment) here in an attempt to centralize information.
Can someone please help with appropriate text?
I don't have that information. This is often caused by incoming changes - the count is going up and down at the same time, occasionally from multiple sources. The rate of change also influences both the rate at which other things can change, and the rate at which refreshes can be processed.
I don't have that information. The processor just uses what resources it can, and the data across records is wildly different. I could make it more predictable, but I'd have to aim that at the worst case scenario. Wild guess, that's maybe 500 records per minute, and for comparison it's running at around 3000 right now. I've done everything I can to make things more efficient, and part of the cost is a lack of transparency.
Absolutely no idea. It tries for first in / first out, but that's not very informative.
Very open to better ideas, no matter how radical. (Not allowing public access to the primary DB might mean - after a LOT of code rewrites - that we don't need a cache, or we could potentially try to make updates less often in larger chunks, or ????????? Or throwing massive amounts of hardware at the current environment is usually the cheapest option, if someone wants to tackle that.)
I think still something well over 90% of the time the cache is less than a minute stale. I understand that's not very comforting when you're waiting for something to happen. |
We hear this over and over that the cause of so many problems is "limited resources". What do we do to fix this? What "resources" do we need to acquire? What grant do we need to write or what hardware or cloud storage do we need to pay for? Do we have a list of this somewhere? |
https://github.com/ArctosDB/internal/issues/330
https://github.com/ArctosDB/PG/issues/30 and from #7946 (comment):
Also #7953 from the same comment. |
Issue Documentation is http://handbook.arctosdb.org/how_to/How-to-Use-Issues-in-Arctos.html
Describe the bug
I entered two individual mineral specimens (catalogue numbers 1575 and 1595) into the Trinity College Dublin Mineral collection (guid_prefix: TCDGM:Mineral). They both made it through 'autoload_core' but then went missing.
To Reproduce
Steps to reproduce the behavior:
Expected behavior
The specimen is entered and appears in searches
Screenshots
This is a screenshot of what I get when I search for the record by catalogue number. The blue arrow goes to this address: 'https://arctos.database.museum/guid' (i.e. missing the actual guid) before redirecting to the search page. If I search by the collection 'TCDGM:Mineral' I get no results.
Data
N/A
Desktop (please complete the following information):
Additional context
Priority
The text was updated successfully, but these errors were encountered: