[mpich-discuss] errors with MPICH2

Rajeev Thakur thakur at mcs.anl.gov
Wed Feb 18 15:07:16 CST 2009


It's more likely to be some configuration settings on the machines.
 
Rajeev
 


  _____  

From: mpich-discuss-bounces at mcs.anl.gov
[mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of
robert.mulloy at shell.com
Sent: Wednesday, February 18, 2009 2:42 PM
To: mpich-discuss at mcs.anl.gov
Subject: Re: [mpich-discuss] errors with MPICH2



Thanks. Could it be a timeout error? Is there a way to increase the time it
waits for a response? 
Robert Mulloy, CCM 
Shell Energy North America 
-------------------------- 
Sent from my BlackBerry Wireless Device 




  _____  

From: mpich-discuss-bounces at mcs.anl.gov <mpich-discuss-bounces at mcs.anl.gov> 
To: mpich-discuss at mcs.anl.gov <mpich-discuss at mcs.anl.gov> 
Sent: Wed Feb 18 14:35:30 2009
Subject: Re: [mpich-discuss] errors with MPICH2 




Probably something with the networking configuration on some of the machines
that prevents processes from contacting each other. You need to run mpdcheck
between every pair of nodes, particularly the ones that show the error.
 
Rajeev


  _____  

From: mpich-discuss-bounces at mcs.anl.gov
[mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of
robert.mulloy at shell.com
Sent: Wednesday, February 18, 2009 1:50 PM
To: mpich-discuss at mcs.anl.gov
Subject: [mpich-discuss] errors with MPICH2




i've compiled MPICH2 and am getting frustrating errors. when i run mpdcheck
it works fine on any 1 or 2 nodes but when i go to more than 2 nodes i keep
getting errors (handle output 393, 384), failed handshake and its a random
node that causes the failure. 

has this happened to anyone else before? 

thanks 

rob 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20090218/82b227a7/attachment-0001.htm>


More information about the mpich-discuss mailing list