-
Notifications
You must be signed in to change notification settings - Fork 121
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gRPC stability issues #3313
Comments
I'm tracking down the issue to: |
In another run, the fail happened either while waiting for the thread ( It might be the second one. More info: I0726 07:12:46.125645533 592846 call.cc:1553] ops[0]: RECV_CLOSE_ON_SERVER cancelled=0x150e00007e6c
>> Received Command : /DELETE,tmp_ezkhtqopya.inp,,,
/OUTPUT FILE=
>> Msg Size: 40
I0726 07:12:46.155433021 592846 call.cc:3710] grpc_call_start_batch(call=0x150e000051f0, ops=0x150e10bdd220, nops=3, tag=0x150e10bdd5e0, reserved=(nil))
I0726 07:12:46.155443836 592846 call.cc:1553] ops[0]: SEND_INITIAL_METADATA(nil)
I0726 07:12:46.155447260 592846 call.cc:1553] ops[1]: SEND_MESSAGE ptr=0x150e00004870
I0726 07:12:46.155450084 592846 call.cc:1553] ops[2]: SEND_STATUS_FROM_SERVER status=0 details=(null)(nil)
I0726 07:12:46.155508894 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e000049c0, tag=0x150e00007e00, error=OK, done=0x150e3e379890, done_arg=0x150e00007f60, storage=0x150e00007fb0)
I0726 07:12:46.155516675 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e000049c0, tag=0x150e10bdd5e0, error=OK, done=0x150e3e379890, done_arg=0x150e00008030, storage=0x150e00008080)
I0726 07:12:46.155522426 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e000049c0]: OP_COMPLETE: tag:0x150e10bdd5e0 OK
I0726 07:12:46.155527646 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150e000049c0)
I0726 07:12:46.155531608 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e000049c0]: OP_COMPLETE: tag:0x150e00007e00 OK
I0726 07:12:46.155534778 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e000049c0]: QUEUE_SHUTDOWN
I0726 07:12:46.155537564 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e00001168)
I0726 07:12:46.155540117 592846 call.cc:967] grpc_call_unref(c=0x150e000051f0)
I0726 07:12:46.155543101 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e000061c8)
I0726 07:12:46.155555437 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e00000f50)
I0726 07:12:46.155559362 592846 completion_queue.cc:1403] grpc_completion_queue_destroy(cq=0x150e000049c0)
I0726 07:12:46.155561297 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150e000049c0)
I0726 07:12:46.155565419 592846 completion_queue.cc:965] grpc_completion_queue_next(cq=0x185bcb0, deadline=gpr_timespec { tv_sec: 617735, tv_nsec: 454733457, clock_type: 0 }, reserved=(nil))
I0726 07:12:46.283204648 592766 call.cc:1553] ops[0]: RECV_INITIAL_METADATA ptr=0x150de40062c8
I0726 07:12:46.283269132 592766 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150de40062a0
I0726 07:12:46.283301731 592766 completion_queue.cc:521] grpc_completion_queue_create_internal(completion_type=1, polling_type=0)
I0726 07:12:46.283307051 592766 metadata_array.cc:30] grpc_metadata_array_init(array=0x150de4004900)
I0726 07:12:46.283312271 592766 completion_queue.cc:710] cq_end_op_for_next(cq=0x185bcb0, tag=0x150de40048c0, error=OK, done=0x150e3e3a35b0, done_arg=0x150de40010b0, storage=0x150de40010d8)
I0726 07:12:46.283392630 592846 completion_queue.cc:1069] RETURN_EVENT[0x185bcb0]: OP_COMPLETE: tag:0x150de40048c0 OK
I0726 07:12:46.283410667 592846 call.cc:3710] grpc_call_start_batch(call=0x150de40052f0, ops=0x150e10bdd810, nops=1, tag=0x150de4007f00, reserved=(nil))
I0726 07:12:46.283415966 592846 call.cc:1553] ops[0]: RECV_CLOSE_ON_SERVER cancelled=0x150de4007f6c
>> Received Command : __PYMAPDL_SESSION_ID__='c9c33bd67d464f33b04b8d7a947'
/OUTPUT FILE=
>> Msg Size: 72
I0726 07:12:46.290713181 592846 call.cc:3710] grpc_call_start_batch(call=0x150de40052f0, ops=0x150e10bdd220, nops=3, tag=0x150e10bdd5e0, reserved=(nil))
I0726 07:12:46.290731054 592846 call.cc:1553] ops[0]: SEND_INITIAL_METADATA(nil)
I0726 07:12:46.290738425 592846 call.cc:1553] ops[1]: SEND_MESSAGE ptr=0x150e00004870
I0726 07:12:46.290745489 592846 call.cc:1553] ops[2]: SEND_STATUS_FROM_SERVER status=0 details=(null)(nil)
I0726 07:12:46.290833731 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150de4004e20, tag=0x150de4007f00, error=OK, done=0x150e3e379890, done_arg=0x150de4008060, storage=0x150de40080b0)
I0726 07:12:46.290850374 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150de4004e20, tag=0x150e10bdd5e0, error=OK, done=0x150e3e379890, done_arg=0x150de4008130, storage=0x150de4008180)
I0726 07:12:46.290862377 592846 completion_queue.cc:1299] RETURN_EVENT[0x150de4004e20]: OP_COMPLETE: tag:0x150e10bdd5e0 OK
I0726 07:12:46.290872722 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150de4004e20)
I0726 07:12:46.290882226 592846 completion_queue.cc:1299] RETURN_EVENT[0x150de4004e20]: OP_COMPLETE: tag:0x150de4007f00 OK
I0726 07:12:46.290889530 592846 completion_queue.cc:1299] RETURN_EVENT[0x150de4004e20]: QUEUE_SHUTDOWN
I0726 07:12:46.290897105 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150de4004b18)
I0726 07:12:46.290904051 592846 call.cc:967] grpc_call_unref(c=0x150de40052f0)
I0726 07:12:46.290911660 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150de40062c8)
I0726 07:12:46.290931772 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150de4004900)
I0726 07:12:46.290939304 592846 completion_queue.cc:1403] grpc_completion_queue_destroy(cq=0x150de4004e20)
I0726 07:12:46.290945142 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150de4004e20)
I0726 07:12:46.290956556 592846 completion_queue.cc:965] grpc_completion_queue_next(cq=0x185bcb0, deadline=gpr_timespec { tv_sec: 617735, tv_nsec: 590122631, clock_type: 0 }, reserved=(nil))
I0726 07:12:46.435088177 592762 call.cc:1553] ops[0]: RECV_INITIAL_METADATA ptr=0x150df40062c8
I0726 07:12:46.435149850 592762 completion_queue.cc:521] grpc_completion_queue_create_internal(completion_type=1, polling_type=0)
I0726 07:12:46.435178654 592762 metadata_array.cc:30] grpc_metadata_array_init(array=0x150df4004530)
I0726 07:12:46.435192186 592762 completion_queue.cc:710] cq_end_op_for_next(cq=0x185bcb0, tag=0x150df40044f0, error=OK, done=0x150e3e3a35b0, done_arg=0x150df4005060, storage=0x150df4005088)
I0726 07:12:46.435271734 592846 completion_queue.cc:1069] RETURN_EVENT[0x185bcb0]: OP_COMPLETE: tag:0x150df40044f0 OK
I0726 07:12:46.435300243 592846 call.cc:3710] grpc_call_start_batch(call=0x150df40052f0, ops=0x150e10bdd810, nops=1, tag=0x150df4007e00, reserved=(nil))
I0726 07:12:46.435311470 592846 call.cc:1553] ops[0]: RECV_CLOSE_ON_SERVER cancelled=0x150df4007e6c
I0726 07:12:46.435366099 592846 call.cc:3710] grpc_call_start_batch(call=0x150df40052f0, ops=0x150e10bdcbc0, nops=1, tag=0x150e10bdd1d8, reserved=(nil))
I0726 07:12:46.435375559 592846 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150e10bdd1f0
I0726 07:12:46.515994160 592782 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 515982833, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516009628 592780 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516001378, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516004315 592776 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 515992831, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516018757 592774 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516010607, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516087958 592783 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516079969, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516090318 592772 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516082325, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516106139 592779 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516098100, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516115230 592781 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516111659, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516136035 592778 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516128189, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516192041 592777 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516183708, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.516964371 592775 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 516959559, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.517957758 592773 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992367, tv_nsec: 517955391, clock_type: 1 }, reserved=(nil))
I0726 07:12:46.573224336 592768 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150df4004a50, tag=0x150e10bdd1d8, error=OK, done=0x150e3e379890, done_arg=0x150df4008030, storage=0x150df4008080)
I0726 07:12:46.573325602 592846 completion_queue.cc:1299] RETURN_EVENT[0x150df4004a50]: OP_COMPLETE: tag:0x150e10bdd1d8 OK
>> Open File tmp_yhmzjiqnwb.inp
I0726 07:12:46.574772302 592846 call.cc:3710] grpc_call_start_batch(call=0x150df40052f0, ops=0x150e10bdcbc0, nops=1, tag=0x150e10bdd358, reserved=(nil))
I0726 07:12:46.574780467 592846 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150e10bdd370
I0726 07:12:46.585773173 592755 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150df4004a50, tag=0x150e10bdd358, error=OK, done=0x150e3e379890, done_arg=0x150df4008030, storage=0x150df4008080)
I0726 07:12:46.585818947 592846 completion_queue.cc:1299] RETURN_EVENT[0x150df4004a50]: OP_COMPLETE: tag:0x150e10bdd358 OK
>> File generated
I0726 07:12:46.586413845 592846 call.cc:3710] grpc_call_start_batch(call=0x150df40052f0, ops=0x150e10bdd260, nops=3, tag=0x150e10bdd5e0, reserved=(nil))
I0726 07:12:46.586418917 592846 call.cc:1553] ops[0]: SEND_INITIAL_METADATA(nil)
I0726 07:12:46.586421280 592846 call.cc:1553] ops[1]: SEND_MESSAGE ptr=0x150e00004870
I0726 07:12:46.586423856 592846 call.cc:1553] ops[2]: SEND_STATUS_FROM_SERVER status=0 details=(null)(nil)
I0726 07:12:46.586457264 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150df4004a50, tag=0x150df4007e00, error=OK, done=0x150e3e379890, done_arg=0x150df4007f60, storage=0x150df4007fb0)
I0726 07:12:46.586462913 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150df4004a50, tag=0x150e10bdd5e0, error=OK, done=0x150e3e379890, done_arg=0x150df4008100, storage=0x150df4008150)
I0726 07:12:46.586466720 592846 completion_queue.cc:1299] RETURN_EVENT[0x150df4004a50]: OP_COMPLETE: tag:0x150e10bdd5e0 OK
I0726 07:12:46.586470425 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150df4004a50)
I0726 07:12:46.586473568 592846 completion_queue.cc:1299] RETURN_EVENT[0x150df4004a50]: OP_COMPLETE: tag:0x150df4007e00 OK
I0726 07:12:46.586476367 592846 completion_queue.cc:1299] RETURN_EVENT[0x150df4004a50]: QUEUE_SHUTDOWN
I0726 07:12:46.586478807 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150df4004748)
I0726 07:12:46.586481215 592846 call.cc:967] grpc_call_unref(c=0x150df40052f0)
I0726 07:12:46.586483869 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150df40062c8)
I0726 07:12:46.586492013 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150df4004530)
I0726 07:12:46.586494893 592846 completion_queue.cc:1403] grpc_completion_queue_destroy(cq=0x150df4004a50)
I0726 07:12:46.586497040 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150df4004a50)
I0726 07:12:46.586501087 592846 completion_queue.cc:965] grpc_completion_queue_next(cq=0x185bcb0, deadline=gpr_timespec { tv_sec: 617735, tv_nsec: 885669370, clock_type: 0 }, reserved=(nil))
I0726 07:12:47.355654865 592757 call.cc:1553] ops[0]: RECV_INITIAL_METADATA ptr=0x150e0c006988
I0726 07:12:47.355717554 592757 completion_queue.cc:521] grpc_completion_queue_create_internal(completion_type=1, polling_type=0)
I0726 07:12:47.355726926 592757 metadata_array.cc:30] grpc_metadata_array_init(array=0x150e0c0052f0)
I0726 07:12:47.355734977 592757 completion_queue.cc:710] cq_end_op_for_next(cq=0x185bcb0, tag=0x150e0c0052b0, error=OK, done=0x150e3e3a35b0, done_arg=0x150e0c004c00, storage=0x150e0c004c28)
I0726 07:12:47.355805663 592846 completion_queue.cc:1069] RETURN_EVENT[0x185bcb0]: OP_COMPLETE: tag:0x150e0c0052b0 OK
I0726 07:12:47.355825236 592846 call.cc:3710] grpc_call_start_batch(call=0x150e0c0059b0, ops=0x150e10bdd810, nops=1, tag=0x150e0c0084c0, reserved=(nil))
I0726 07:12:47.355831023 592846 call.cc:1553] ops[0]: RECV_CLOSE_ON_SERVER cancelled=0x150e0c00852c
I0726 07:12:47.355886267 592846 call.cc:3710] grpc_call_start_batch(call=0x150e0c0059b0, ops=0x150e10bdcbc0, nops=1, tag=0x150e10bdd1d8, reserved=(nil))
I0726 07:12:47.355890988 592846 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150e10bdd1f0
I0726 07:12:47.359560590 592764 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e0c004a00, tag=0x150e10bdd1d8, error=OK, done=0x150e3e379890, done_arg=0x150e0c0086f0, storage=0x150e0c008740)
I0726 07:12:47.359651878 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e0c004a00]: OP_COMPLETE: tag:0x150e10bdd1d8 OK
>> Open File _input_tmp_fwqhsgxfkw_.inp
I0726 07:12:47.361279615 592846 call.cc:3710] grpc_call_start_batch(call=0x150e0c0059b0, ops=0x150e10bdcbc0, nops=1, tag=0x150e10bdd358, reserved=(nil))
I0726 07:12:47.361288144 592846 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150e10bdd370
I0726 07:12:47.361301455 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e0c004a00, tag=0x150e10bdd358, error=OK, done=0x150e3e379890, done_arg=0x150e0c0086f0, storage=0x150e0c008740)
I0726 07:12:47.361307719 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e0c004a00]: OP_COMPLETE: tag:0x150e10bdd358 OK
>> File generated
I0726 07:12:47.363793787 592846 call.cc:3710] grpc_call_start_batch(call=0x150e0c0059b0, ops=0x150e10bdd260, nops=3, tag=0x150e10bdd5e0, reserved=(nil))
I0726 07:12:47.363799342 592846 call.cc:1553] ops[0]: SEND_INITIAL_METADATA(nil)
I0726 07:12:47.363801663 592846 call.cc:1553] ops[1]: SEND_MESSAGE ptr=0x150e00004870
I0726 07:12:47.363804277 592846 call.cc:1553] ops[2]: SEND_STATUS_FROM_SERVER status=0 details=(null)(nil)
I0726 07:12:47.363854303 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e0c004a00, tag=0x150e0c0084c0, error=OK, done=0x150e3e379890, done_arg=0x150e0c008620, storage=0x150e0c008670)
I0726 07:12:47.363861167 592846 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150e0c004a00, tag=0x150e10bdd5e0, error=OK, done=0x150e3e379890, done_arg=0x150e0c0087c0, storage=0x150e0c008810)
I0726 07:12:47.363864971 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e0c004a00]: OP_COMPLETE: tag:0x150e10bdd5e0 OK
I0726 07:12:47.363869113 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150e0c004a00)
I0726 07:12:47.363872744 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e0c004a00]: OP_COMPLETE: tag:0x150e0c0084c0 OK
I0726 07:12:47.363875421 592846 completion_queue.cc:1299] RETURN_EVENT[0x150e0c004a00]: QUEUE_SHUTDOWN
I0726 07:12:47.363878145 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e0c005508)
I0726 07:12:47.363880656 592846 call.cc:967] grpc_call_unref(c=0x150e0c0059b0)
I0726 07:12:47.363883147 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e0c006988)
I0726 07:12:47.363898215 592846 metadata_array.cc:35] grpc_metadata_array_destroy(array=0x150e0c0052f0)
I0726 07:12:47.363901768 592846 completion_queue.cc:1403] grpc_completion_queue_destroy(cq=0x150e0c004a00)
I0726 07:12:47.363903664 592846 completion_queue.cc:1398] grpc_completion_queue_shutdown(cq=0x150e0c004a00)
I0726 07:12:47.363908302 592846 completion_queue.cc:965] grpc_completion_queue_next(cq=0x185bcb0, deadline=gpr_timespec { tv_sec: 617736, tv_nsec: 663076263, clock_type: 0 }, reserved=(nil))
I0726 07:12:47.515266432 592765 call.cc:1553] ops[0]: RECV_INITIAL_METADATA ptr=0x150df0006588
I0726 07:12:47.515333109 592765 call.cc:1553] ops[0]: RECV_MESSAGE ptr=0x150df0006560
I0726 07:12:47.515352885 592765 completion_queue.cc:521] grpc_completion_queue_create_internal(completion_type=1, polling_type=0)
I0726 07:12:47.515360519 592765 metadata_array.cc:30] grpc_metadata_array_init(array=0x150df0001790)
I0726 07:12:47.515367937 592765 completion_queue.cc:710] cq_end_op_for_next(cq=0x185bcb0, tag=0x150df0001750, error=OK, done=0x150e3e3a35b0, done_arg=0x150df0001eb0, storage=0x150df0001ed8)
I0726 07:12:47.515385958 592846 completion_queue.cc:1069] RETURN_EVENT[0x185bcb0]: OP_COMPLETE: tag:0x150df0001750 OK
I0726 07:12:47.515407163 592846 call.cc:3710] grpc_call_start_batch(call=0x150df00055b0, ops=0x150e10bdd810, nops=1, tag=0x150df0008450, reserved=(nil))
I0726 07:12:47.515412219 592846 call.cc:1553] ops[0]: RECV_CLOSE_ON_SERVER cancelled=0x150df00084bc
>> Entering: Mapdl::InputFileS
>> InputFile Name : _input_tmp_fwqhsgxfkw_.inp
>> Mapdl::InputFileS: Not mute.
/OUTPUT FILE=
I0726 07:12:47.570766831 592984 call.cc:3710] grpc_call_start_batch(call=0x150df00055b0, ops=0x150e109dc460, nops=2, tag=0x150df0001a60, reserved=(nil))
I0726 07:12:47.570783782 592984 call.cc:1553] ops[0]: SEND_INITIAL_METADATA(nil)
I0726 07:12:47.570787278 592984 call.cc:1553] ops[1]: SEND_MESSAGE ptr=0x150d88001180
I0726 07:12:47.570834558 592984 completion_queue.cc:784] cq_end_op_for_pluck(cq=0x150df0001cb0, tag=0x150df0001a60, error=OK, done=0x150e3e379890, done_arg=0x150df0008680, storage=0x150df00086d0)
I0726 07:12:47.570844429 592984 completion_queue.cc:1299] RETURN_EVENT[0x150df0001cb0]: OP_COMPLETE: tag:0x150df0001a60 OK
I0726 07:12:47.616965839 592776 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 616959995, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.616972546 592777 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 616967004, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.616979852 592780 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 616975833, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.616987025 592774 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 616982749, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.616994259 592782 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 616990178, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617020602 592783 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617016532, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617032718 592781 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617028567, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617060243 592779 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617056524, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617065905 592778 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617063902, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617083323 592772 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617079767, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.617950739 592775 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 617948157, clock_type: 1 }, reserved=(nil))
I0726 07:12:47.618953630 592773 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992368, tv_nsec: 618949750, clock_type: 1 }, reserved=(nil))
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/POST1 '
*****MAPDL VERIFICATION RUN ONLY*****
DO NOT USE RESULTS FOR PRODUCTION
***** MAPDL RESULTS INTERPRETATION (POST1) *****
USE LAST SUBSTEP ON RESULT FILE FOR LOAD CASE 0
SET COMMAND GOT LOAD STEP= 1 SUBSTEP= 1 CUMULATIVE ITERATION= 1
TIME/FREQUENCY= 1.0000
TITLE= VM1, STATICALLY INDETERMINATE REACTION FORCE ANALYSIS
>> Mapdl::InputFileS: Running command: 'NSEL,S,LOC,Y,10 '
SELECT FOR ITEM=LOC COMPONENT=Y BETWEEN 10.000 AND 10.000
KABS= 0. TOLERANCE= 0.500000E-01
1 NODES (OF 4 DEFINED) SELECTED BY NSEL COMMAND.
>> Mapdl::InputFileS: Running command: 'ESLN '
SELECT ALL ELEMENTS HAVING ANY NODE IN NODAL SET.
1 ELEMENTS (OF 3 DEFINED) SELECTED FROM
1 SELECTED NODES BY ESLN COMMAND.
>> Mapdl::InputFileS: Running command: 'FSUM '
***** SUMMATION OF TOTAL FORCES AND MOMENTS IN THE GLOBAL COORDINATE SYSTEM *****
FX = 0.000000
FY = -900.0000
FZ = 0.000000
MX = 0.000000
MY = 0.000000
MZ = 0.000000
SUMMATION POINT= 0.0000 0.0000 0.0000
>> Mapdl::InputFileS: Running command: '*GET,REAC_1,FSUM,,ITEM,FY '
*GET REAC_1 FROM FSUM ITEM=ITEM FY VALUE= -900.000000
>> Mapdl::InputFileS: Running command: 'NSEL,S,LOC,Y,0 '
SELECT FOR ITEM=LOC COMPONENT=Y BETWEEN 0.0000 AND 0.0000
KABS= 0. TOLERANCE= 0.100000E-05
1 NODES (OF 4 DEFINED) SELECTED BY NSEL COMMAND.
>> Mapdl::InputFileS: Running command: 'ESLN '
SELECT ALL ELEMENTS HAVING ANY NODE IN NODAL SET.
1 ELEMENTS (OF 3 DEFINED) SELECTED FROM
1 SELECTED NODES BY ESLN COMMAND.
>> Mapdl::InputFileS: Running command: 'FSUM '
***** SUMMATION OF TOTAL FORCES AND MOMENTS IN THE GLOBAL COORDINATE SYSTEM *****
FX = 0.000000
FY = -600.0000
FZ = 0.000000
MX = 0.000000
MY = 0.000000
MZ = 0.000000
SUMMATION POINT= 0.0000 0.0000 0.0000
>> Mapdl::InputFileS: Running command: '*GET,REAC_2,FSUM,,ITEM,FY '
*GET REAC_2 FROM FSUM ITEM=ITEM FY VALUE= -600.000000
>> Mapdl::InputFileS: Running command: ' '
>> Mapdl::InputFileS: Running command: '*DIM,LABEL,CHAR,2 '
SET PARAMETER DIMENSIONS ON LABEL TYPE=CHAR DIMENSIONS= 2 1 1
>> Mapdl::InputFileS: Running command: '*DIM,VALUE,,2,3 '
SET PARAMETER DIMENSIONS ON VALUE TYPE=ARRA DIMENSIONS= 2 3 1
>> Mapdl::InputFileS: Running command: 'LABEL(1) = 'R1, lb','R2, lb ' '
PARAMETER LABEL(1) = R1, lb R2, lb
>> Mapdl::InputFileS: Running command: '*VFILL,VALUE(1,1),DATA,900.0,600.0 '
VECTOR FILL OPERATION *VFILL
VALUE(1,1) = DATA
VECTOR LENGTH= 2
VALUES= 900.00 600.00
>> Mapdl::InputFileS: Running command: '*VFILL,VALUE(1,2),DATA,ABS(REAC_1),ABS(REAC_2) '
VECTOR FILL OPERATION *VFILL
VALUE(1,2) = DATA
VECTOR LENGTH= 2
VALUES= 900.00 600.00
>> Mapdl::InputFileS: Running command: '*VFILL,VALUE(1,3),DATA,ABS(REAC_1 / 900) ,ABS( REAC_2 / 600) '
VECTOR FILL OPERATION *VFILL
VALUE(1,3) = DATA
VECTOR LENGTH= 2
VALUES= 1.0000 1.0000
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/OUT,vm1,vrt '
/OUTPUT FILE=
>> Mapdl::InputFileS: Running command: 'FINISH '
EXIT THE MAPDL POST1 DATABASE PROCESSOR
***** ROUTINE COMPLETED ***** CP = 0.000
>> Mapdl::InputFileS: Running command: '*LIST,vm1,vrt '
LISTING OF THE DATA ON FILE
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM '
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM,------------------- VM1 RESULTS COMPARISON --------------------- '
------------------- VM1 RESULTS COMPARISON ---------------------
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM, '
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM, | TARGET | Mechanical APDL | RATIO '
| TARGET | Mechanical APDL | RATIO
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM, '
>> Mapdl::InputFileS: Running command: '*VWRITE,LABEL(1),VALUE(1,1),VALUE(1,2),VALUE(1,3) '
R1, lb 900.0 900.0 1.000
R2, lb 600.0 600.0 1.000
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/COM,---------------------------------------------------------------- '
----------------------------------------------------------------
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/OUT '
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/CLE,NOSTART '
CLEAR MAPDL DATABASE AND RESTART
Ansys Mechanical Enterprise
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/HEF C^? '
>> Mapdl::InputFileS: if 'StrCmd[0] == '/''
>> Mapdl::InputFileS: Running command: '/HEF C^? '
>> Mapdl::InputFileS: Reading output file'
>> Mapdl::InputFileS: Output file read'
>> Mapdl::InputFileS: Waiting for thread to finish.'
*** ERROR *** CP = 0.000 TIME= 00:00:00
An unexpected error ( Segmentation Violation ) has occurred... MAPDL
internal data has been corrupted. MAPDL is unable to recover and will
terminate. Previously saved files are unaffected. Please send the
data leading to this operation to your technical support provider, as
this will allow ANSYS, Inc to improve the program.
*** NOTE *** CP = 0.000 TIME= 00:00:00
Additional information about signal generated errors can be found in
QSView -> Dept/Team Info -> Core ANSYS Team -> DEBUGGING.
Current ANSYS Traceback:
ANSYS
GRPC_Start
gRPCMapdlStart
ANSYS
+--------------------- M A P D L S T A T I S T I C S ------------------------+
Release: 2024 R2 Build: 24.2BETA Update: UP20240603 Platform: LINUX x64
Using: DEVELOPMENT Build
Date Run: 07/26/2024 Time: 07:12 Process ID: 592654
Operating System: Red Hat Enterprise Linux release 9.4 (Plow)
Processor Model: Intel(R) Xeon(R) Gold 6136 CPU @ 3.00GHz
Compiler: Intel(R) Fortran Compiler Classic Version 2021.9 (Build: 20230302)
Intel(R) C/C++ Compiler Classic Version 2021.9 (Build: 20230302)
Intel(R) oneAPI Math Kernel Library Version 2023.1-Product Build 20230303
Number of processes requested : 4
Number of threads per process requested : 1
Total number of cores requested : 4 (Distributed Memory Parallel)
MPI Type: INTELMPI
MPI Version: Intel(R) MPI Library 2021.11 for Linux* OS
GPU Acceleration: Not Requested
Job Name: file
Core Machine Name Working Directory
-----------------------------------------------------
0 lyolinlst5.ansys.com /home/gayuso/tmp
1 lyolinlst5.ansys.com /home/gayuso/tmp
2 lyolinlst5.ansys.com /home/gayuso/tmp
3 lyolinlst5.ansys.com /home/gayuso/tmp
Latency time from master to core 1 = 3.138 microseconds
Latency time from master to core 2 = 3.261 microseconds
Latency time from master to core 3 = 3.054 microseconds
Communication speed from master to core 1 = 8427.67 MB/sec
Communication speed from master to core 2 = 3983.21 MB/sec
Communication speed from master to core 3 = 3991.54 MB/sec
Total CPU time for main thread : 0.0 seconds
Total CPU time summed for all threads : 6.9 seconds
Elapsed time spent obtaining a license : 2.7 seconds
Elapsed time spent pre-processing model (/PREP7) : 0.0 seconds
Elapsed time spent solution - preprocessing : 0.0 seconds
Elapsed time spent computing solution : 0.0 seconds
Elapsed time spent solution - postprocessing : 0.0 seconds
Elapsed time spent post-processing model (/POST1) : 0.0 seconds
Elapsed time spent post-processing model (/POST26): 0.3 seconds
Equation solver used : Sparse (symmetric)
+------------------ E N D M A P D L S T A T I S T I C S -------------------+
*-----------------------------------------------------------------------------*
| |
| RUN COMPLETED |
| |
|-----------------------------------------------------------------------------|
| |
| Ansys MAPDL 2024 R2 Build 24.2BETA UP20240603 LINUX x64 |
| |
|-----------------------------------------------------------------------------|
| |
| Database Requested(-db) 1024 MB Scratch Memory Requested 1024 MB |
| Maximum Database Used 1 MB Maximum Scratch Memory Used 46 MB |
| |
|-----------------------------------------------------------------------------|
| |
| CP Time (sec) = 6.894 Time = 07:12:48 |
| Elapsed Time (sec) = 30.000 Date = 07/26/2024 |
| |
*-----------------------------------------------------------------------------*
I0726 07:12:48.718028046 592779 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718016111, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718052499 592782 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718045158, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718033900 592778 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718022570, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718036176 592776 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718023572, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718079224 592780 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718071051, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718123037 592781 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718112957, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718140697 592783 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718132668, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718146127 592774 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718137964, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718134345 592777 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718126596, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718201919 592772 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718193710, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.718958500 592775 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 718954756, clock_type: 1 }, reserved=(nil))
I0726 07:12:48.719952955 592773 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992369, tv_nsec: 719949156, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.818985531 592774 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 818975394, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.818989384 592782 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 818972646, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.818988743 592776 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 818979975, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819034032 592777 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819025706, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819059077 592772 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819048695, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819070779 592778 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819061633, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819081412 592779 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819073245, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819075831 592781 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819066262, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819116336 592780 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819107909, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819085525 592783 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819077489, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.819982367 592775 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 819977932, clock_type: 1 }, reserved=(nil))
I0726 07:12:49.820960490 592773 completion_queue.cc:965] grpc_completion_queue_next(cq=0x194d800, deadline=gpr_timespec { tv_sec: 1721992370, tv_nsec: 820958128, clock_type: 1 }, reserved=(nil))
Abort(33) on node 0 (rank 0 in comm 0): application called MPI_Abort(MPI_COMM_WORLD, 33) - process 0 |
Now it doesn't seem to fail... reaching up to 2500 iterations, whereas before it was failing around 30-60. 🧐 Let's keep this stale for the moment. |
I tested also v25.1.0 docker image and it does not reproduce the error. |
I should check with SMP. |
Working on WindowsMost of the times, the One crashI caught a crash while running Application error
Windows error reportings
and
Report.wer
From the
From 1
|
Running the Output
Crash file
PyMAPDL outputThis is shown in PyMAPDL as:
|
Analysing the dump file (from: #3313 (comment)) on Windbg, it seems we hit a break point?
Full dump analysisDetails
|
While running v241 I saw it crashing: PyMAPDL output
MAPDL output fileProcess died.
Event viewer
From this event, it seems it might be a memory leak issue reference Task managerCPUsMemory RAMDump file analysis:
Again the breakpoint error... |
For the record, crashing errors are not logged to thread output files ( I mean crashing errors like:
|
I'm going to use this issue to track and report progress.
Description
MAPDL tends to crash after several interations.
Prediscussion and assumptions
It has been suggested here that it might be related to a race conditio when writing the output file using (
/out
).I do believe this issue #2386 is very related with all of this, so I'm testing something similar.
I should also check #1114
Testing framework
Running MAPDL on lyon5.
Connecting to it locally and run:
Example results
crash file
MAPDL stdout
Full output
Details
The text was updated successfully, but these errors were encountered: