edu.harvard.iq.dataverse.api.FilesIT.testForceReplaceAndUpdate Test Failing #6846

djbrooke · 2020-04-21T18:52:16Z

Expected status code <200> doesn't match actual status code <500>.

See build 425 on dataverse.org Jenkins for more info.

pdurbin · 2020-04-22T21:00:20Z

At first I could reproduce the FilesIT.testForceReplaceAndUpdate failure locally but the error went away after I doubled the following

-XX:MaxMetaspaceSize=512m
-XX:MetaspaceSize=256m

(I did this because I'm getting this java.lang.OutOfMemoryError: Metaspace in my logs when running even a single API test.)

Now I can't reproduce the original errors but when I run the whole API test suite (which I couldn't do with Glassfish 4.1 without it falling over) I get a seemingly random assortment of failing tests, just like we're seeing on Jenkins.

When I look at my logs, I see many cases of org.postgresql.util.PSQLException: ERROR: deadlock detected which we discussed but never fixed in #2460. I think these deadlocks should be discussed in tech hours.

I'm not sure what to do with this issue.

djbrooke · 2020-04-22T21:39:35Z

Thanks @pdurbin for the details.

@scolapasta can you get automated testing on the docket for a future tech hours? We've been picking off the failing tests one by one, but I saw on some recent runs that 4 new ones were failing so I'm not sure if we should re-evaluate. Generally I'm happy to spend sprint time in the automated test area. :)

poikilotherm · 2020-04-23T11:25:42Z

About the MetaspaceSize and MaxMetaspaceSize: I stumbeld over https://dzone.com/articles/permgen-and-metaspace and my attention was catched here:

Whenever there is a need to resize PermGen/Metaspace, JVM will do it as it does with the standard heap. Resizing those spaces requires a full GC, which is always an expensive operation. It can usually be observed during a startup when a lot of classes are being loaded. Especially if the application has dependencies on many external libraries. If there are a lot of full GCs during the startup, it’s usually because of that. If that case, increasing the initial size can boost the startup performance.

Has anyone taken a look at this during startup? During playing a bit with container memory limits, I noticed a few times rising and sudden drops in memory use. I wonder if this is related to GC and if the metaspace is related to our very long deploy times, next to our massive amount of beans we are loading...

donsizemore · 2020-04-23T11:37:01Z

@poikilotherm I sent the Metaspace settings along with https://blog.payara.fish/fine-tuning-payara-server-5-in-production to roll in as part of the switch to Payara 5:

dataverse/scripts/installer/as-setup.sh

Line 67 in 68bb656

./asadmin $ASADMIN_OPTS create-jvm-options "-XX\:MaxMetaspaceSize=512m"

In tinkering with Prometheus I did see a series of fairly steep memory reclamation during garbage collection; I can try again with some test Metaspace settings during deployment.

djbrooke · 2020-05-20T18:23:09Z

We expect that this class of error is related to DB deadlocking (Internal Exception: org.postgresql.util.PSQLException: ERROR: deadlock detected #6865). If this is the case, we can close this issue.
We should address this after deadlocks issue (will prioritize below on the board).

pdurbin · 2020-07-28T20:54:18Z

@djbrooke @scolapasta now that #6865 is closed (🎉 ) should we close this issue as well?

scolapasta · 2020-07-28T20:56:36Z

I am not sure if this was caused by deadlocks, but more importantly the past two jenkins builds (I'm not including the one that couldn't connect) passed. So unless we see this occur again, yes we can close.

djbrooke · 2020-07-28T20:56:41Z

@pdurbin Sure, we can reopen if we start to see consistent failures from this test !

djbrooke · 2020-07-28T20:57:04Z

@scolapasta 👍

scolapasta · 2020-07-28T20:57:19Z

@djbrooke beat me to it, while I was typing my comment! (but I got the comment in first at least :))

pdurbin · 2022-03-03T15:05:20Z

We just noticed another case of testForceReplaceAndUpdate failing at https://jenkins.dataverse.org/blue/organizations/jenkins/IQSS-Dataverse-Develop-PR/detail/PR-8440/2/tests for pull request #8440.

pdurbin · 2022-03-15T20:05:50Z

Another case of testForceReplaceAndUpdate failing at https://jenkins.dataverse.org/blue/organizations/jenkins/IQSS-Dataverse-Develop-PR/detail/PR-8486/2/tests for PR #8486.

pdurbin · 2022-05-05T14:54:49Z

testForceReplaceAndUpdate just failed at https://jenkins.dataverse.org/blue/organizations/jenkins/IQSS-Dataverse-Develop-PR/detail/PR-8624/4/tests for #8624.

That's it. Time to reopen this issue. 😄

donsizemore · 2022-05-23T21:42:27Z

testForceReplaceAndUpdate failure seen again at https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/view/change-requests/job/PR-8689/1/consoleFull

The Payara log doesn't seem particularly helpful, sorry.

pdurbin · 2022-05-24T15:46:44Z

https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/view/change-requests/job/PR-8689/1/testReport/edu.harvard.iq.dataverse.api/FilesIT/testForceReplaceAndUpdate/ is showing a 500 error on FilesIT.testForceReplaceAndUpdate(FilesIT.java:668)

That's this line:

dataverse/src/test/java/edu/harvard/iq/dataverse/api/FilesIT.java

Line 668 in 8ec50c8

assertEquals(OK.getStatusCode(), updateMetadataResponse.getStatusCode());

From server.log.txt we see that the metadata of the file couldn't be edited due to a dataset lock:

[#|2022-05-11T13:30:47.039+0000|WARNING|Payara 5.2021.6|edu.harvard.iq.dataverse.api.Files|_ThreadID=69;_ThreadName=http-thread-pool::http-listener-1(2);_TimeMillis=1652275847039;_LevelValue=900;|
Dataset publication finalization: exception while exporting:{0}
edu.harvard.iq.dataverse.api.AbstractApiBean$WrappedResponse: edu.harvard.iq.dataverse.engine.command.exception.IllegalCommandException: Dataset cannot be edited due to dataset lock.
at edu.harvard.iq.dataverse.api.AbstractApiBean.execCommand(AbstractApiBean.java:633)
at edu.harvard.iq.dataverse.api.Files.updateFileMetadata(Files.java:423)

So, I think we just need to add our normal UtilIT.sleepForLock in the test code just before the failing line.

mreekie · 2022-05-25T18:44:52Z

sprint

sized small

mreekie · 2022-06-08T17:55:24Z

Sprint:

pm.sprint.2022_05_25 ended OnDeck

pdurbin · 2022-07-22T21:11:31Z

Another case: https://jenkins.dataverse.org/job/IQSS-Dataverse-Develop-PR/view/change-requests/job/PR-8792/2/testReport/edu.harvard.iq.dataverse.api/FilesIT/testForceReplaceAndUpdate/

add sleep in FilesIT, clean up assertions #6846

pdurbin self-assigned this Apr 22, 2020

pdurbin removed their assignment Apr 22, 2020

djbrooke mentioned this issue May 20, 2020

Two File Replace Integration Tests Failing #6914

Closed

djbrooke modified the milestone: Dataverse 5 Jun 4, 2020

pdurbin mentioned this issue Jun 5, 2020

Unable to change file metadata via API if not also changing label or directoryLabel #6962

Closed

djbrooke added this to the Dataverse 5 milestone Jun 8, 2020

pdurbin added a commit that referenced this issue Jun 26, 2020

prevent database deadlocks #6846 #6865 #7020

1e4cb6f

pdurbin added a commit that referenced this issue Jun 26, 2020

prevent database deadlocks #6846 #6865 #7020

dbfb699

pdurbin mentioned this issue Jun 26, 2020

prevent database deadlocks #6846 #6865 #7020 #7029

Closed

pdurbin added a commit that referenced this issue Jun 26, 2020

prevent database deadlocks #6846 #6865 #7020

f5d7a61

pdurbin mentioned this issue Jun 26, 2020

prevent database deadlocks #6846 #6865 #7020 #7030

Closed

djbrooke closed this as completed Jul 28, 2020

pdurbin mentioned this issue Nov 16, 2021

Auxiliary File API Enhancements #8237

Merged

pdurbin reopened this May 5, 2022

pdurbin mentioned this issue May 5, 2022

8623 solr index vocab #8624

Merged

mreekie added the sz.Small label May 25, 2022

mreekie added the pm.sprint.2022_05_25 label May 25, 2022

pdurbin removed this from the Dataverse 5 milestone May 25, 2022

mreekie added the pm.sprint.2022_06_08 label Jun 8, 2022

pdurbin added a commit that referenced this issue Jul 25, 2022

add sleep in FilesIT, clean up assertions #6846

f5f35fd

pdurbin self-assigned this Jul 25, 2022

pdurbin mentioned this issue Jul 25, 2022

add sleep in FilesIT, clean up assertions #6846 #8858

Merged

pdurbin removed their assignment Jul 25, 2022

kcondon closed this as completed in #8858 Jul 27, 2022

kcondon added a commit that referenced this issue Jul 27, 2022

Merge pull request #8858 from IQSS/6846-tests

38c7976

add sleep in FilesIT, clean up assertions #6846

pdurbin added this to the 5.12 milestone Aug 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

edu.harvard.iq.dataverse.api.FilesIT.testForceReplaceAndUpdate Test Failing #6846

edu.harvard.iq.dataverse.api.FilesIT.testForceReplaceAndUpdate Test Failing #6846

djbrooke commented Apr 21, 2020

pdurbin commented Apr 22, 2020

djbrooke commented Apr 22, 2020 •

edited

Loading

poikilotherm commented Apr 23, 2020 •

edited

Loading

donsizemore commented Apr 23, 2020

djbrooke commented May 20, 2020 •

edited

Loading

pdurbin commented Jul 28, 2020

scolapasta commented Jul 28, 2020

djbrooke commented Jul 28, 2020

djbrooke commented Jul 28, 2020

scolapasta commented Jul 28, 2020

pdurbin commented Mar 3, 2022

pdurbin commented Mar 15, 2022

pdurbin commented May 5, 2022

donsizemore commented May 23, 2022

pdurbin commented May 24, 2022

mreekie commented May 25, 2022

mreekie commented Jun 8, 2022

pdurbin commented Jul 22, 2022

edu.harvard.iq.dataverse.api.FilesIT.testForceReplaceAndUpdate Test Failing #6846

edu.harvard.iq.dataverse.api.FilesIT.testForceReplaceAndUpdate Test Failing #6846

Comments

djbrooke commented Apr 21, 2020

pdurbin commented Apr 22, 2020

djbrooke commented Apr 22, 2020 • edited Loading

poikilotherm commented Apr 23, 2020 • edited Loading

donsizemore commented Apr 23, 2020

djbrooke commented May 20, 2020 • edited Loading

pdurbin commented Jul 28, 2020

scolapasta commented Jul 28, 2020

djbrooke commented Jul 28, 2020

djbrooke commented Jul 28, 2020

scolapasta commented Jul 28, 2020

pdurbin commented Mar 3, 2022

pdurbin commented Mar 15, 2022

pdurbin commented May 5, 2022

donsizemore commented May 23, 2022

pdurbin commented May 24, 2022

mreekie commented May 25, 2022

mreekie commented Jun 8, 2022

pdurbin commented Jul 22, 2022

djbrooke commented Apr 22, 2020 •

edited

Loading

poikilotherm commented Apr 23, 2020 •

edited

Loading

djbrooke commented May 20, 2020 •

edited

Loading