[mpich-discuss] Finalize abort in mpd

Hiatt, Dave M dave.m.hiatt at citi.com
Tue Oct 25 16:24:13 CDT 2011


Roger that, thanks

-----Original Message-----
From: mpich-discuss-bounces at mcs.anl.gov [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Darius Buntinas
Sent: Tuesday, October 25, 2011 3:07 PM
To: mpich-discuss at mcs.anl.gov
Subject: Re: [mpich-discuss] Finalize abort in mpd

I don't believe Hydra will work over Windows (yet).  SMPD is your only option for now.

-d

On Oct 25, 2011, at 3:03 PM, Hiatt, Dave M wrote:

> Actually it does seg fault.  I'll get one and forward it.  Right now I was going to try and build Hydra for the Win 7 environment and see if I can get it to trigger over here as I have VS 2010 and TotalView in this environment.  I presume Hydra will work in Windows.  
> 
> -----Original Message-----
> From: mpich-discuss-bounces at mcs.anl.gov [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Darius Buntinas
> Sent: Tuesday, October 25, 2011 2:58 PM
> To: mpich-discuss at mcs.anl.gov
> Subject: Re: [mpich-discuss] Finalize abort in mpd
> 
> Oops, sorry I thought you said there was a seg fault.  Never mind getting a core dump.  What is the error message you're getting?
> 
> -d
> 
> 
> On Oct 25, 2011, at 2:54 PM, Hiatt, Dave M wrote:
> 
>> Hydra oddly enough exhibits the same behavior.  Smpd appears to be the only one that will run through.  Would that suggest anything?  What is different is that in this version of the app we are sending materially larger and more volume of data back to Node 0.  We develop and test in Windows 7 and we are suing smpd there, and things are bullet proof.  In the past problem were easily recreated, so this is really puzzling.  We are building with gcc and have not had any problems in the past in terms of MPI when we transition between RH and Window so the implications of this are dismaying.  
>> 
>> Here's the odd part, when the problem appears we've essentially completed the run, all the data has been received and pushed to the data files, the compute nodes appear to have all called Finalize and are sitting on the barrier waiting on Node 0 to call Finalize.
>> 
>> One other factoid, this is an OpenMP hybrid application.  But again that has not been an issue in the past.
>> 
>> 
>> -----Original Message-----
>> From: mpich-discuss-bounces at mcs.anl.gov [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Dave Goodell
>> Sent: Tuesday, October 25, 2011 12:11 PM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: Re: [mpich-discuss] Finalize abort in mpd
>> 
>> http://wiki.mcs.anl.gov/mpich2/index.php/Frequently_Asked_Questions#Q:_I_don.27t_like_.3CWHATEVER.3E_about_mpd.2C_or_I.27m_having_a_problem_with_mpdboot.2C_can_you_fix_it.3F
>> 
>> Just use hydra.
>> 
>> -Dave
>> 
>> On Oct 25, 2011, at 11:34 AM CDT, Hiatt, Dave M wrote:
>> 
>>> We have been running with 1.21 for some time with no problems, but with our latest release we now get an error when Finalize is called if we are running mpd.  If we run smpd in RH Linux there is no problem.  I suspect this has probably been seen before but I have had no luck in a Google search so my apologies if this has been answered before.  But could someone be so kind as to tell me what we have done to ourselves if this is a known problem.
>>> 
>>> "So they go on in strange paradox, decided only to be undecided, resolved to be irresolute, adamant for drift, solid for fluidity, all-powerful to be impotent." 
>>> 
>>> Dave M. Hiatt
>>> Director, Risk Analytics
>>> CitiMortgage
>>> 1000 Technology Drive
>>> O'Fallon, MO 63368-2240
>>> 
>>> Telephone:  636-261-1408
>>> 
>>> 
>>> _______________________________________________
>>> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
>>> To manage subscription options or unsubscribe:
>>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>> 
>> _______________________________________________
>> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
>> To manage subscription options or unsubscribe:
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>> _______________________________________________
>> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
>> To manage subscription options or unsubscribe:
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
> 
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss

_______________________________________________
mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
To manage subscription options or unsubscribe:
https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss


More information about the mpich-discuss mailing list