Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different behaviour between fds and fds_openmp #13207

Closed
rupole1185 opened this issue Jul 23, 2024 · 12 comments
Closed

Different behaviour between fds and fds_openmp #13207

rupole1185 opened this issue Jul 23, 2024 · 12 comments
Assignees

Comments

@rupole1185
Copy link

Describe the bug
SEGMENTATION FAULT

forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
libpthread-2.31.s 000014909224D420 Unknown Unknown Unknown
fds 000000000738DED9 Unknown Unknown Unknown
fds 000000000738D170 Unknown Unknown Unknown
fds 0000000006AF79E7 Unknown Unknown Unknown
fds 0000000006AB6BCE Unknown Unknown Unknown
fds 000000000040A71D Unknown Unknown Unknown
libc-2.31.so 000014909030E083 __libc_start_main Unknown Unknown
fds 000000000040A636 Unknown Unknown Unknown

= BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
= RANK 0 PID 11757 RUNNING AT hpc-server49403
= KILLED BY SIGNAL: 9 (Killed)

To Reproduce
Use the attached file [42122-sc6-A-02.fds]
Run FDS6.9.1 on 60 cores single thread ( mpiexec -np 60 fds file_name.fds )
The issue seems to be related to the high number of files opened by FDS. I have also tried to increase the limit of files using the following: ulimit -n 99999

But I still got the error.

If instead you execute the very same case using 2 threads ( mpiexec -np 60 fds_openmp file_name.fds ) the simulation works ok.

Expected behavior
Simulation should work both when single and when multi-threads.

Desktop (please complete the following information):

  • OS: Ubuntu
  • Version: 2004

42122-sc6-A-02.fds.txt

@drjfloyd drjfloyd self-assigned this Jul 23, 2024
@drjfloyd
Copy link
Contributor

Fortran limits the number of open files per process but I don't think your case is coming close to that limit as the open files would be the error, out, and csv files which are just a handful of files. SLCF is opened and closed per write. When does your case fail? I am running now with 6.9.1 and it is out to over 2 s.

@rupole1185
Copy link
Author

rupole1185 commented Jul 23, 2024 via email

@drjfloyd
Copy link
Contributor

Are you running the precompiled binaries or have you compiled on your own?

@rupole1185
Copy link
Author

rupole1185 commented Jul 23, 2024 via email

@drjfloyd
Copy link
Contributor

This is running without issues for me. What does the start of the .out file say for the case that fails?

Fire Dynamics Simulator

Current Date : July 23, 2024 07:04:18
Revision : FDS-6.9.1-0-g9743202-release
Revision Date : Fri Apr 5 16:53:17 2024 -0400
Compiler : Intel(R) Fortran Intel(R) 64 Compiler Classic for applications running on Intel(R) 64, Version 2021.7.1 Build 20221019_000000
Compilation Date : Apr 06, 2024 04:35:02

Number of MPI Processes: 60

@rupole1185
Copy link
Author

rupole1185 commented Jul 23, 2024 via email

@drjfloyd
Copy link
Contributor

@mcgratta @rmcdermo @gforney Does this case run for you? I can't reproduce the error on our system. It is running for mew with release and the current source.

@rmcdermo
Copy link
Contributor

The latest compiled source ran fine for me. But the 6.9.1 release failed, see test.err.txt below.
test.err.txt
test.fds.txt

@drjfloyd
Copy link
Contributor

@rupole1185 can you try with the latest test release and see if that works for you?

https://github.com/firemodels/test_bundles

@rupole1185
Copy link
Author

rupole1185 commented Jul 23, 2024 via email

@rupole1185
Copy link
Author

rupole1185 commented Jul 29, 2024 via email

@rmcdermo
Copy link
Contributor

Thanks for letting us know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants