-
Notifications
You must be signed in to change notification settings - Fork 358
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ascent appears to cause a segfault #3873
Comments
Is this running on GPU? Some Ascent actions (data binning, in particular) assume that GPU memory is accessible from the CPU (as is the case on Summit, Frontier, and other unified memory systems). See: Alpine-DAV/ascent#1122. |
@cyrush Cyrus, do you have any suggestions? |
I think I misunderstood your original issue. Your issue actually seems very similar to a different issue I reported here: #2994. I never fully understood why that happened, but it was somehow (?) caused by an unrelated global variable. |
I cannot reproduce the issue with the heat equation test. This is what I did.
I also tried |
@robertsawko do you have any custom classes like mentioned in #2994 ? If it's a c++ static init and finalize class issue, those are very very hard to reason about. (The order for when things are deallocated is not guaranteed) I will look into |
I am really sorry - I am taking time to respond. Our HPC has serious I/O issues (which is why working on in situ is even more relevant!), but currently it's just unusable. If it's like this next week still, I will reproduce the environment locally and retry. |
@robertsawko -- Closing this issue due to lack of activity. Please re-open it if there is still a problem. |
Hello,
Together with two colleagues we were using AMReX built-in Ascent integration. We religiously followed the three blueprint tutorials and implemented a function which uses our finest mesh to produce a mesh blueprint with
SingleLevelToBlueprint
and pass it to Ascent with some actions to execute.The simulations run okay, generate images as expected, but end up with a segfault:
This is strange for me in many ways. Firstly, it looks like AMReX actually finalizes fine. I am used to segfaults being quite fatal to running programs.
The problem happens in parallel and in serial. I cannot reproduce with a heat equation tutorial, but one of my colleague reports that he saw something like this in 2D heat equation too.
The code itself is not even very interesting:
I've run a backtrace on the core dump, but I am still none the wiser:
Could you please give us any suggestions as to what might be going wrong?
The text was updated successfully, but these errors were encountered: