[mpich-discuss] Can't run an MPI-Programm on more than one host

Dave Goodell goodell at mcs.anl.gov
Wed Oct 24 09:24:44 CDT 2012


I don't see any obvious problems in the output that you sent.  Pavan suggested offline that this could be related to a bad interaction that hydra sometimes has with csh/tcsh.  Perhaps try running under bash or zsh just to check if this is the case?

I'm sorry that I don't have any other suggestions for you at this time.

-Dave

On Oct 16, 2012, at 12:42 PM CDT, Andreas Hauffe wrote:

> Here the output:
> mpiexec -v -f machinefile -n 4 hostname
> host: mlr114u
> host: mlr113u
> 
> ==================================================================================================
> mpiexec options:
> ----------------
>  Base path: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/
>  Launcher: (null)
>  Debug level: 1
>  Enable X: -1
> 
>  Global environment:
>  -------------------
>    LANG=de_DE
>    USER=hauffe
>    LOGNAME=hauffe
>    HOME=/home/lft/mitarbeiter/hauffe
>    PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin
>    MAIL=/var/mail/hauffe
>    SHELL=/usr/bin/tcsh
>    SSH_CLIENT=141.30.52.234 53502 22
>    SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22
>    SSH_TTY=/dev/pts/0
>    TERM=xterm
>    XDG_SESSION_ID=144
>    XDG_RUNTIME_DIR=/run/user/hauffe
>    NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat
>    HOSTTYPE=x86_64
>    VENDOR=suse
>    OSTYPE=linux
>    MACHTYPE=x86_64-suse-linux
>    SHLVL=1
>    PWD=/btmpl/Software
>    GROUP=lftuser
>    HOST=mlr114u
>    CSHEDIT=emacs
>    CPU=x86_64
>    HOSTNAME=mlr114u.mw.tu-dresden.de
>    INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc
>    LESS=-M -I -R
>    LESSOPEN=lessopen.sh %s
>    LESSCLOSE=lessclose.sh %s %s
>    LESS_ADVANCED_PREPROCESSOR=no
>    LESSKEY=/etc/lesskey.bin
>    PAGER=less
>    MORE=-sl
>    MINICOM=-c on
>    MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man
>    XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB
>    XNLSPATH=/usr/share/X11/nls
>    COLORTERM=1
>    SSH_SENDS_LOCALE=yes
>    JAVA_BINDIR=/usr/lib64/jvm/jre/bin
>    JAVA_ROOT=/usr/lib64/jvm/jre
>    JAVA_HOME=/usr/lib64/jvm/jre
>    JRE_HOME=/usr/lib64/jvm/jre
>    CVS_RSH=ssh
>    XCURSOR_THEME=DMZ
>    QT_SYSTEM_DIR=/usr/share/desktop-data
>    LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin
>    NXDIR=/usr/NX
>    FROM_HEADER=
>    NNTPSERVER=news
>    WINDOWMANAGER=/usr/bin/kde4
>    ALSA_CONFIG_PATH=/etc/alsa-pulse.conf
>    SDL_AUDIODRIVER=pulse
>    PYTHONSTARTUP=/etc/pythonstart
>    XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help
>    XDG_CONFIG_DIRS=/etc/xdg
>    G_BROKEN_FILENAMES=1
>    G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252
>    CSHRCREAD=true
>    LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:
>    LS_OPTIONS=-N --color=tty -T 0
>    GPG_TTY=/dev/pts/0
>    MODULE_VERSION=3.2.8
>    MODULE_VERSION_STACK=3.2.8
>    MODULESHOME=/w/appl/lft/Modules/3.2.8
>    MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications
>    LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0
>    _LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0
>    INTEL_LICENSE_FILE=28518 at mlr25s
>    LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64
>    PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin
>    BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:
>    BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:
>    SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349
>    SSH_AGENT_PID=6350
> 
>  Hydra internal environment:
>  ---------------------------
>    GFORTRAN_UNBUFFERED_PRECONNECTED=y
> 
> 
>    Proxy information:
>    *********************
>      [1] proxy: mlr114u (1 cores)
>      Exec list: hostname (1 processes);
> 
>      [2] proxy: mlr113u (3 cores)
>      Exec list: hostname (3 processes);
> 
> 
> ==================================================================================================
> 
> [mpiexec at mlr114u] Timeout set to -1 (-1 means infinite)
> [mpiexec at mlr114u] Got a control port string of mlr114u:46425
> 
> Proxy launch args: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id
> 
> Arguments being passed to proxy 0:
> --version 1.5 --iface-ip-env-name MPICH_INTERFACE_HOSTNAME --hostname mlr114u --global-core-map 0,1,4 --pmi-id-map 0,0 --global-process-count 4 --auto-cleanup 1 --pmi-kvsname kvs_6389_0 --pmi-process-mapping (vector,(0,1,1),(1,1,3)) --ckpoint-num -1 --global-inherited-env 75 'LANG=de_DE' 'USER=hauffe' 'LOGNAME=hauffe' 'HOME=/home/lft/mitarbeiter/hauffe' 'PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin' 'MAIL=/var/mail/hauffe' 'SHELL=/usr/bin/tcsh' 'SSH_CLIENT=141.30.52.234 53502 22' 'SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22' 'SSH_TTY=/dev/pts/0' 'TERM=xterm' 'XDG_SESSION_ID=144' 'XDG_RUNTIME_DIR=/run/user/hauffe' 'NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat' 'HOSTTYPE=x86_64' 'VENDOR=suse' 'OSTYPE=linux' 'MACHTYPE=x86_64-suse-linux' 'SHLVL=1' 'PWD=/btmpl/Software' 'GROUP=lftuser' 'HOST=mlr114u' 'CSHEDIT=emacs' 'CPU=x86_64' 'HOSTNAME=mlr114u.mw.tu-dresden.de' 'INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc' 'LESS=-M -I -R' 'LESSOPEN=lessopen.sh %s' 'LESSCLOSE=lessclose.sh %s %s' 'LESS_ADVANCED_PREPROCESSOR=no' 'LESSKEY=/etc/lesskey.bin' 'PAGER=less' 'MORE=-sl' 'MINICOM=-c on' 'MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man' 'XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB' 'XNLSPATH=/usr/share/X11/nls' 'COLORTERM=1' 'SSH_SENDS_LOCALE=yes' 'JAVA_BINDIR=/usr/lib64/jvm/jre/bin' 'JAVA_ROOT=/usr/lib64/jvm/jre' 'JAVA_HOME=/usr/lib64/jvm/jre' 'JRE_HOME=/usr/lib64/jvm/jre' 'CVS_RSH=ssh' 'XCURSOR_THEME=DMZ' 'QT_SYSTEM_DIR=/usr/share/desktop-data' 'LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'NXDIR=/usr/NX' 'FROM_HEADER=' 'NNTPSERVER=news' 'WINDOWMANAGER=/usr/bin/kde4' 'ALSA_CONFIG_PATH=/etc/alsa-pulse.conf' 'SDL_AUDIODRIVER=pulse' 'PYTHONSTARTUP=/etc/pythonstart' 'XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help' 'XDG_CONFIG_DIRS=/etc/xdg' 'G_BROKEN_FILENAMES=1' 'G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252' 'CSHRCREAD=true' 'LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:' 'LS_OPTIONS=-N --color=tty -T 0' 'GPG_TTY=/dev/pts/0' 'MODULE_VERSION=3.2.8' 'MODULE_VERSION_STACK=3.2.8' 'MODULESHOME=/w/appl/lft/Modules/3.2.8' 'MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications' 'LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0' '_LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0' 'INTEL_LICENSE_FILE=28518 at mlr25s' 'LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64' 'PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:' 'BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:' 'SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349' 'SSH_AGENT_PID=6350' --global-user-env 0 --global-system-env 1 'GFORTRAN_UNBUFFERED_PRECONNECTED=y' --proxy-core-count 1 --exec --exec-appnum 0 --exec-proc-count 1 --exec-local-env 0 --exec-wdir /btmpl/Software --exec-args 1 hostname
> 
> Arguments being passed to proxy 1:
> --version 1.5 --iface-ip-env-name MPICH_INTERFACE_HOSTNAME --hostname mlr113u --global-core-map 0,3,4 --pmi-id-map 0,1 --global-process-count 4 --auto-cleanup 1 --pmi-kvsname kvs_6389_0 --pmi-process-mapping (vector,(0,1,1),(1,1,3)) --ckpoint-num -1 --global-inherited-env 75 'LANG=de_DE' 'USER=hauffe' 'LOGNAME=hauffe' 'HOME=/home/lft/mitarbeiter/hauffe' 'PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin' 'MAIL=/var/mail/hauffe' 'SHELL=/usr/bin/tcsh' 'SSH_CLIENT=141.30.52.234 53502 22' 'SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22' 'SSH_TTY=/dev/pts/0' 'TERM=xterm' 'XDG_SESSION_ID=144' 'XDG_RUNTIME_DIR=/run/user/hauffe' 'NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat' 'HOSTTYPE=x86_64' 'VENDOR=suse' 'OSTYPE=linux' 'MACHTYPE=x86_64-suse-linux' 'SHLVL=1' 'PWD=/btmpl/Software' 'GROUP=lftuser' 'HOST=mlr114u' 'CSHEDIT=emacs' 'CPU=x86_64' 'HOSTNAME=mlr114u.mw.tu-dresden.de' 'INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc' 'LESS=-M -I -R' 'LESSOPEN=lessopen.sh %s' 'LESSCLOSE=lessclose.sh %s %s' 'LESS_ADVANCED_PREPROCESSOR=no' 'LESSKEY=/etc/lesskey.bin' 'PAGER=less' 'MORE=-sl' 'MINICOM=-c on' 'MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man' 'XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB' 'XNLSPATH=/usr/share/X11/nls' 'COLORTERM=1' 'SSH_SENDS_LOCALE=yes' 'JAVA_BINDIR=/usr/lib64/jvm/jre/bin' 'JAVA_ROOT=/usr/lib64/jvm/jre' 'JAVA_HOME=/usr/lib64/jvm/jre' 'JRE_HOME=/usr/lib64/jvm/jre' 'CVS_RSH=ssh' 'XCURSOR_THEME=DMZ' 'QT_SYSTEM_DIR=/usr/share/desktop-data' 'LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'NXDIR=/usr/NX' 'FROM_HEADER=' 'NNTPSERVER=news' 'WINDOWMANAGER=/usr/bin/kde4' 'ALSA_CONFIG_PATH=/etc/alsa-pulse.conf' 'SDL_AUDIODRIVER=pulse' 'PYTHONSTARTUP=/etc/pythonstart' 'XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help' 'XDG_CONFIG_DIRS=/etc/xdg' 'G_BROKEN_FILENAMES=1' 'G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252' 'CSHRCREAD=true' 'LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:' 'LS_OPTIONS=-N --color=tty -T 0' 'GPG_TTY=/dev/pts/0' 'MODULE_VERSION=3.2.8' 'MODULE_VERSION_STACK=3.2.8' 'MODULESHOME=/w/appl/lft/Modules/3.2.8' 'MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications' 'LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0' '_LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0' 'INTEL_LICENSE_FILE=28518 at mlr25s' 'LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64' 'PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:' 'BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:' 'SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349' 'SSH_AGENT_PID=6350' --global-user-env 0 --global-system-env 1 'GFORTRAN_UNBUFFERED_PRECONNECTED=y' --proxy-core-count 3 --exec --exec-appnum 0 --exec-proc-count 3 --exec-local-env 0 --exec-wdir /btmpl/Software --exec-args 1 hostname
> 
> [mpiexec at mlr114u] Launch arguments: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id 0
> [mpiexec at mlr114u] Launch arguments: /usr/bin/ssh -x mlr113u "/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy" --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id 1
> mlr114u
> [mpiexec at mlr114u] control_cb (./pm/pmiserv/pmiserv_cb.c:201): assert (!closed) failed
> [mpiexec at mlr114u] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
> [mpiexec at mlr114u] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:196): error waiting for event
> [mpiexec at mlr114u] main (./ui/mpich/mpiexec.c:325): process manager error waiting for completion
> 
> 
> Zitat von Dave Goodell <goodell at mcs.anl.gov>:
> 
>> Please also send us the output of "mpiexec -v -f machinefile -n 4  hostname" (note the "-v" addition).  This will give us a better idea  of why hydra is having trouble launching the hostname program.
>> 
>> -Dave
>> 
>> On Oct 15, 2012, at 2:50 AM CDT, Andreas Hauffe wrote:
>> 
>>> Hi,
>>> 
>>> I need help to run a MPI-Program with MPICH2 1.5 on more than one host. If I
>>> try the install example:
>>> mpiexec -f machinefile -n <number> hostname
>>> I get the following output:
>>> host> mpiexec -f machinefile -n 4 hostname
>>> mlr114u
>>> [mpiexec at mlr114u] control_cb (./pm/pmiserv/pmiserv_cb.c:201):  assert (!closed)
>>> failed
>>> [mpiexec at mlr114u] HYDT_dmxu_poll_wait_for_event
>>> (./tools/demux/demux_poll.c:77): callback returned error status
>>> [mpiexec at mlr114u] HYD_pmci_wait_for_completion
>>> (./pm/pmiserv/pmiserv_pmci.c:196): error waiting for event
>>> [mpiexec at mlr114u] main (./ui/mpich/mpiexec.c:325): process manager error
>>> waiting for completion
>>> 
>>> I attachted all files but there was no
>>> mpich2-1.5/src/pm/hydra/tools/topo/hwloc/hwloc/config.log
>>> 
>>> The command "mpiexec -info":
>>> 
>>> HYDRA build details:
>>>   Version:                                 1.5
>>>   Release Date:                            Mon Oct  8 14:00:48 CDT 2012
>>>   CC:                              gcc
>>>   CXX:                             c++
>>>   F77:                             ifort
>>>   F90:                             ifort
>>>   Configure options:                       '--disable-option-checking' '--
>>> prefix=/w/appl/lft/mpich/mpich2-1.5.0p0_64bit' '--cache-file=/dev/null' '--
>>> srcdir=.' 'CC=gcc' 'CFLAGS= -O2' 'LDFLAGS= ' 'LIBS=-lrt -lpthread '  'CPPFLAGS=
>>> -I/btmpl/Software/mpich2-1.5/src/mpl/include -
>>> I/btmpl/Software/mpich2-1.5/src/mpl/include -
>>> I/btmpl/Software/mpich2-1.5/src/openpa/src -
>>> I/btmpl/Software/mpich2-1.5/src/openpa/src -
>>> I/btmpl/Software/mpich2-1.5/src/mpi/romio/include'
>>>   Process Manager:                         pmi
>>>   Launchers available:                     ssh rsh fork slurm ll lsf sge
>>> manual persist
>>>   Topology libraries available:            hwloc
>>>   Resource management kernels available:   user slurm ll lsf sge pbs
>>>   Checkpointing libraries available:
>>>   Demux engines available:                 poll select
>>> 
>>> --
>>> Viele Grüße
>>> Andreas Hauffe
>>> Leiter der Arbeitsgruppe "Auslegungsmethoden für Luftfahrzeuge"
>>> 
>>> ----------------------------------------------------------------------------------------------------
>>> Technische Universität Dresden
>>> Institut für Luft- und Raumfahrttechnik / Institute of Aerospace Engineering
>>> Lehrstuhl für Luftfahrzeugtechnik / Chair of Aircraft Engineering
>>> 
>>> D-01062 Dresden
>>> Germany
>>> 
>>> phone : +49 (351) 463 38496
>>> fax :  +49 (351) 463 37263
>>> mail : andreas.hauffe at tu-dresden.de
>>> Website : http://tu-dresden.de/mw/ilr/lft
>>> ----------------------------------------------------------------------------------------------------<files.tar.gz>_______________________________________________
>>> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
>>> To manage subscription options or unsubscribe:
>>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>> 
>> _______________________________________________
>> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
>> To manage subscription options or unsubscribe:
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>> 
> 
> 
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list