[MPICH] mpirun timeout and killed by signal-2 error for 64 processor option

Bala cppbala at yahoo.com
Mon Mar 19 01:35:11 CDT 2007


Thanks Rajeev, for the reply, we are using 
rocks cluster-4.2.1 that comes with mpich2 by default.

 But still we are getting this error, we are using 
HP blade servers BL460C is tere any known issues
with blades??

thanks,
-bala-


--- Rajeev Thakur <thakur at mcs.anl.gov> wrote:

> Can you try MPICH2 instead of MPICH-1? It is more
> robust. cpi should run
> with any number of processes.
> 
> Rajeev 
> 
> > -----Original Message-----
> > From: owner-mpich-discuss at mcs.anl.gov 
> > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf
> Of Bala
> > Sent: Saturday, March 17, 2007 8:47 AM
> > To: mpich-discuss at mcs.anl.gov
> > Subject: [MPICH] mpirun timeout and killed by
> signal error 
> > for 64 processor option
> > 
> > Hi All,
> >         we have installed mpich on 16 node Intel
> > X86_64
> > dual CPU and dual core cluster( blade servers).
> > 
> >   when we try to run mpirun with cpi sample for
> > -np 32 option runs fine and gives the output also,
> but
> > 
> > after a while there is message like shown below
> > 
> > -----------------------------
> > pi is approximately 3.1416009869231249, Error is
> > 0.0000083333333318
> > wall clock time = 0.003906
> > Timeout in waiting for processes to exit, 2 left. 
> > This may be due to a defectie rsh program (Some
> > versions of Kerberos rsh have been observed to
> have
> > this problem).
> > This is not a problem with P4 or MPICH but a
> problem
> > with the operating
> > environment.  For many applications, this problem
> will
> > only slow down process termination.
> > -----------------------------------
> > 
> > but when we try to run with -np 64 and above
> options
> > 
> > $mpirun -np 64 -machinefile machines ./cpi
> > we get fails with killed by signal 2 error, in our
> > other cluster we can run with -np 64 option.
> > 
> > pls let us know how to avoid these errors??
> > 
> > Is it cpi is too small for -np 64 option to run??
> > 
> > thanks in advance,
> > -bala-
> > 
> > 
> > 
> > 
> >  
> >
>
______________________________________________________________
> > ______________________
> > Need Mail bonding?
> > Go to the Yahoo! Mail Q&A for great tips from
> Yahoo! Answers users.
> >
>
http://answers.yahoo.com/dir/?link=list&sid=396546091
> > 
> > 
> 
> 




 
____________________________________________________________________________________
Need Mail bonding?
Go to the Yahoo! Mail Q&A for great tips from Yahoo! Answers users.
http://answers.yahoo.com/dir/?link=list&sid=396546091




More information about the mpich-discuss mailing list