[petsc-dev] [mpich-discuss] turning off MPI abort messages
Munson, Todd S.
tmunson at mcs.anl.gov
Fri Feb 21 13:54:28 CST 2014
Sounds like someone needs to look up the definition of "customer support".
On Feb 21, 2014, at 1:20 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
> Aron told me that I should have said "we look forward to your patch"
> earlier instead of causing you guys to go apoplectic over my different
> definition of the word crash.
>
> Jeff
>
> On Fri, Feb 21, 2014 at 1:13 PM, Barry Smith <bsmith at mcs.anl.gov> wrote:
>>
>> Understood. But I would like to eliminate both sets of error messages and still provide a useful “return code”. Perhaps compile time options to the library?
>>
>> Barry
>>
>> On Feb 21, 2014, at 12:40 PM, Jim Dinan <james.dinan at gmail.com> wrote:
>>
>>> A little more detail -- you're actually getting messages from two sources: (1) the MPICH library ("application called MPI_Abort...") and (2) the job launcher ("BAD TERMINATION..."). You can eliminate the messages from the job launcher by providing an error code of 0 in MPI_Abort.
>>>
>>> ~Jim.
>>>
>>>
>>>
>>>
>>> On Fri, Feb 21, 2014 at 1:19 PM, Jeff Hammond <jeff.science at gmail.com> wrote:
>>>>> Just configure MPICH such that snprintf isn't discovered by configure
>>>>> and you won't see these messages.
>>>>>
>>>>> The other solution is to fix PETSc so that people can't crash it so easily ;-)
>>>>
>>>> Here we go again. It is not CRASHING; it has detected an error conditioning and trying to appropriately and cleanly terminate. The reason it needs to use MPI_Abort() is that often detecting error conditions is not a uniformly collective thing.
>>>>
>>>> Printing a suitable error message and ending is not crashing. But with all the badly formatted “error messages” printed by MPICH I can not control at the end it looks like it is crashing.
>>>
>>> You're returning a non-zero exit code, which I consider crashing. I
>>> apologize if this definition disagrees with yours. If this is just
>>> gentle cleanup, why not exit with code=0 as Jim suggested already?
>>>
>>> Jeff
>>>
>>>>> On Thu, Feb 20, 2014 at 3:19 PM, Jim Dinan <james.dinan at gmail.com> wrote:
>>>>>> If you can find a way to call MPI_Finalize instead, you will portably
>>>>>> eliminate these messages.
>>>>>>
>>>>>> A lesser solution would be to provide an error code of 0 (or MPI_SUCCESS) to
>>>>>> MPI_Abort, e.g. MPI_Comm_abort(MPI_COMM_WORLD, MPI_SUCCESS). This would
>>>>>> eliminate the error message that you are getting from the job launcher.
>>>>>> MPICH could be modified to be quiet about the abort when the application
>>>>>> aborts with an error code of MPI_SUCCESS.
>>>>>>
>>>>>> ~Jim.
>>>>>>
>>>>>>
>>>>>> On Thu, Feb 20, 2014 at 12:33 PM, Barry Smith <bsmith at mcs.anl.gov> wrote:
>>>>>>>
>>>>>>>
>>>>>>> Is there any way to turn off MPICH (and others) printing messages about
>>>>>>> MPI_Abort? We have already prepared and presented useful error messages to
>>>>>>> the user about the situation and would like to avoid having these additional
>>>>>>> messages printed (that often make the situation look worse than it is)
>>>>>>>
>>>>>>> Thanks
>>>>>>>
>>>>>>> Barry
>>>>>>>
>>>>>>> application called MPI_Abort(MPI_COMM_WORLD, 56) - process 0
>>>>>>> [cli_0]: aborting job:
>>>>>>> application called MPI_Abort(MPI_COMM_WORLD, 56) - process 0
>>>>>>>
>>>>>>>
>>>>>>> ==================================================================mailto:discuss at mpich.org=================
>>>>>>> = BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
>>>>>>> = EXIT CODE: 56
>>>>>>> = CLEANING UP REMAINING PROCESSES
>>>>>>> = YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
>>>>>>>
>>>>>>> ===================================================================================
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> _______________________________________________
>>>>>>> discuss mailing list discuss at mpich.org
>>>>>>> To manage subscription options or unsubscribe:
>>>>>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>>>>
>>>>>>
>>>>>>
>>>>>> _______________________________________________
>>>>>> discuss mailing list discuss at mpich.org
>>>>>> To manage subscription options or unsubscribe:
>>>>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Jeff Hammond
>>>>> jeff.science at gmail.com
>>>>> _______________________________________________
>>>>> discuss mailing list discuss at mpich.org
>>>>> To manage subscription options or unsubscribe:
>>>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>>
>>>> _______________________________________________
>>>> discuss mailing list discuss at mpich.org
>>>> To manage subscription options or unsubscribe:
>>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>
>>>
>>>
>>> --
>>> Jeff Hammond
>>> jeff.science at gmail.com
>>> _______________________________________________
>>> discuss mailing list discuss at mpich.org
>>> To manage subscription options or unsubscribe:
>>> https://lists.mpich.org/mailman/listinfo/discuss
>>>
>>> _______________________________________________
>>> discuss mailing list discuss at mpich.org
>>> To manage subscription options or unsubscribe:
>>> https://lists.mpich.org/mailman/listinfo/discuss
>>
>> _______________________________________________
>> discuss mailing list discuss at mpich.org
>> To manage subscription options or unsubscribe:
>> https://lists.mpich.org/mailman/listinfo/discuss
>
>
>
> --
> Jeff Hammond
> jeff.science at gmail.com
More information about the petsc-dev
mailing list