<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.6000.16608" name=GENERATOR></HEAD>
<BODY text=#000000 bgColor=#ffffff>
<DIV dir=ltr align=left><SPAN class=308135723-18022008><FONT face=Arial
color=#0000ff size=2>Nothing needs to be on the child nodes except the
executables. When compiling, make sure that no mpif.h files already exist in any
of the application directories. Some Fortran applications come with those files,
and they are not compatible across implementations. Other than that I don't know
what the problem might be. Or you can check with the wrf application
developers.</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=308135723-18022008><FONT face=Arial
color=#0000ff size=2></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=308135723-18022008><FONT face=Arial
color=#0000ff size=2>Rajeev</FONT></SPAN></DIV><BR>
<BLOCKQUOTE
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px solid; MARGIN-RIGHT: 0px">
<DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> Mina Azer
[mailto:azer@envsci.rutgers.edu] <BR><B>Sent:</B> Monday, February 18, 2008
5:19 PM<BR><B>To:</B> Rajeev Thakur<BR><B>Cc:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> Re: rm_3422: p4_error: interrupt
SIGSEGV: 11<BR></FONT><BR></DIV>
<DIV></DIV>the entire application is compiled using the same MPI. the program
does run with all other 11 child nodes except for the recently installed node
"Centos 5". What "files" need to be on the child nodes for mpirun to work
correctly?<BR><BR>Rajeev Thakur wrote:
<BLOCKQUOTE cite=mid00d201c87282$daa599e0$860add8c@mcs.anl.gov type="cite">
<META content="MSHTML 6.00.6000.16608" name=GENERATOR>
<DIV dir=ltr align=left><SPAN class=884160323-18022008><FONT face=Arial
color=#0000ff size=2>Make sure that there are no mpif.h files lying around
in any of the application directories and make sure that the entire
application is compiled using the same MPI
implementation.</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=884160323-18022008><FONT face=Arial
color=#0000ff size=2></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=884160323-18022008><FONT face=Arial
color=#0000ff size=2>Rajeev</FONT></SPAN></DIV><BR>
<BLOCKQUOTE
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: rgb(0,0,255) 2px solid; MARGIN-RIGHT: 0px">
<DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> Mina Azer [<A
class=moz-txt-link-freetext
href="mailto:azer@envsci.rutgers.edu">mailto:azer@envsci.rutgers.edu</A>]
<BR><B>Sent:</B> Monday, February 18, 2008 4:00 PM<BR><B>To:</B> <A
class=moz-txt-link-abbreviated
href="mailto:mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</A><BR><B>Subject:</B>
rm_3422: p4_error: interrupt SIGSEGV: 11<BR></FONT><BR></DIV>Hello
all,<BR><BR>I am new to MPICH however i have read and google ed about my
question but couldn't find a solution. <BR>we have a cluster consisting of
a head node and 12 child nodes all running redhat AS V3. <BR>head node
<BR>runing pgi + mpich 1.2.6.2+wrf<BR>nfs share directory to all child
nodes where we keep programs we want to run with mpich<BR>we use rsh to
login since the system is on private newtork<BR>a node list with nodes
name is created and saved to the shared nfs direcotry<BR><BR>recently I
reinstalled a node with centos 5 and since we are not able to run mpirun
on it<BR><BR><I>/usr/local/mpich-1.2.6-2/bin/mpirun -p4pg nodes.list
wrf.exe<BR>rm_3422: p4_error: interrupt SIGSEGV: 11<BR>Segmentation
fault<BR>p0_22242: p4_error: Child process exited while making
connection to remote process on c3cn12:
0<BR>/usr/local/mpich-1.2.6-2/bin/mpirun: line 1: 22242 Broken
pipe
/d7/user/Dir7/wrf.exe -p4pg nodes.list -p4wd /d7/user/Dir7<BR>P4 procgroup
file is nodes.list.</I><BR><BR>I have mounted the pgi directory on the
child node and still getting the error message. <BR><BR>Is the error
related to the fact that this child node is Centos5 where everything else
is redhat? or the version of redhat is
old?<BR><BR>Thanks<BR></BLOCKQUOTE></BLOCKQUOTE><BR></BLOCKQUOTE></BODY></HTML>