[mpich-discuss] Can't run an MPI-Programm on more than one host
Dave Goodell
goodell at mcs.anl.gov
Wed Oct 24 09:24:44 CDT 2012
I don't see any obvious problems in the output that you sent. Pavan suggested offline that this could be related to a bad interaction that hydra sometimes has with csh/tcsh. Perhaps try running under bash or zsh just to check if this is the case?
I'm sorry that I don't have any other suggestions for you at this time.
-Dave
On Oct 16, 2012, at 12:42 PM CDT, Andreas Hauffe wrote:
> Here the output:
> mpiexec -v -f machinefile -n 4 hostname
> host: mlr114u
> host: mlr113u
>
> ==================================================================================================
> mpiexec options:
> ----------------
> Base path: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/
> Launcher: (null)
> Debug level: 1
> Enable X: -1
>
> Global environment:
> -------------------
> LANG=de_DE
> USER=hauffe
> LOGNAME=hauffe
> HOME=/home/lft/mitarbeiter/hauffe
> PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin
> MAIL=/var/mail/hauffe
> SHELL=/usr/bin/tcsh
> SSH_CLIENT=141.30.52.234 53502 22
> SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22
> SSH_TTY=/dev/pts/0
> TERM=xterm
> XDG_SESSION_ID=144
> XDG_RUNTIME_DIR=/run/user/hauffe
> NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat
> HOSTTYPE=x86_64
> VENDOR=suse
> OSTYPE=linux
> MACHTYPE=x86_64-suse-linux
> SHLVL=1
> PWD=/btmpl/Software
> GROUP=lftuser
> HOST=mlr114u
> CSHEDIT=emacs
> CPU=x86_64
> HOSTNAME=mlr114u.mw.tu-dresden.de
> INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc
> LESS=-M -I -R
> LESSOPEN=lessopen.sh %s
> LESSCLOSE=lessclose.sh %s %s
> LESS_ADVANCED_PREPROCESSOR=no
> LESSKEY=/etc/lesskey.bin
> PAGER=less
> MORE=-sl
> MINICOM=-c on
> MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man
> XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB
> XNLSPATH=/usr/share/X11/nls
> COLORTERM=1
> SSH_SENDS_LOCALE=yes
> JAVA_BINDIR=/usr/lib64/jvm/jre/bin
> JAVA_ROOT=/usr/lib64/jvm/jre
> JAVA_HOME=/usr/lib64/jvm/jre
> JRE_HOME=/usr/lib64/jvm/jre
> CVS_RSH=ssh
> XCURSOR_THEME=DMZ
> QT_SYSTEM_DIR=/usr/share/desktop-data
> LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin
> NXDIR=/usr/NX
> FROM_HEADER=
> NNTPSERVER=news
> WINDOWMANAGER=/usr/bin/kde4
> ALSA_CONFIG_PATH=/etc/alsa-pulse.conf
> SDL_AUDIODRIVER=pulse
> PYTHONSTARTUP=/etc/pythonstart
> XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help
> XDG_CONFIG_DIRS=/etc/xdg
> G_BROKEN_FILENAMES=1
> G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252
> CSHRCREAD=true
> LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:
> LS_OPTIONS=-N --color=tty -T 0
> GPG_TTY=/dev/pts/0
> MODULE_VERSION=3.2.8
> MODULE_VERSION_STACK=3.2.8
> MODULESHOME=/w/appl/lft/Modules/3.2.8
> MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications
> LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0
> _LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0
> INTEL_LICENSE_FILE=28518 at mlr25s
> LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64
> PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin
> BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:
> BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:
> SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349
> SSH_AGENT_PID=6350
>
> Hydra internal environment:
> ---------------------------
> GFORTRAN_UNBUFFERED_PRECONNECTED=y
>
>
> Proxy information:
> *********************
> [1] proxy: mlr114u (1 cores)
> Exec list: hostname (1 processes);
>
> [2] proxy: mlr113u (3 cores)
> Exec list: hostname (3 processes);
>
>
> ==================================================================================================
>
> [mpiexec at mlr114u] Timeout set to -1 (-1 means infinite)
> [mpiexec at mlr114u] Got a control port string of mlr114u:46425
>
> Proxy launch args: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id
>
> Arguments being passed to proxy 0:
> --version 1.5 --iface-ip-env-name MPICH_INTERFACE_HOSTNAME --hostname mlr114u --global-core-map 0,1,4 --pmi-id-map 0,0 --global-process-count 4 --auto-cleanup 1 --pmi-kvsname kvs_6389_0 --pmi-process-mapping (vector,(0,1,1),(1,1,3)) --ckpoint-num -1 --global-inherited-env 75 'LANG=de_DE' 'USER=hauffe' 'LOGNAME=hauffe' 'HOME=/home/lft/mitarbeiter/hauffe' 'PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin' 'MAIL=/var/mail/hauffe' 'SHELL=/usr/bin/tcsh' 'SSH_CLIENT=141.30.52.234 53502 22' 'SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22' 'SSH_TTY=/dev/pts/0' 'TERM=xterm' 'XDG_SESSION_ID=144' 'XDG_RUNTIME_DIR=/run/user/hauffe' 'NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat' 'HOSTTYPE=x86_64' 'VENDOR=suse' 'OSTYPE=linux' 'MACHTYPE=x86_64-suse-linux' 'SHLVL=1' 'PWD=/btmpl/Software' 'GROUP=lftuser' 'HOST=mlr114u' 'CSHEDIT=emacs' 'CPU=x86_64' 'HOSTNAME=mlr114u.mw.tu-dresden.de' 'INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc' 'LESS=-M -I -R' 'LESSOPEN=lessopen.sh %s' 'LESSCLOSE=lessclose.sh %s %s' 'LESS_ADVANCED_PREPROCESSOR=no' 'LESSKEY=/etc/lesskey.bin' 'PAGER=less' 'MORE=-sl' 'MINICOM=-c on' 'MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man' 'XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB' 'XNLSPATH=/usr/share/X11/nls' 'COLORTERM=1' 'SSH_SENDS_LOCALE=yes' 'JAVA_BINDIR=/usr/lib64/jvm/jre/bin' 'JAVA_ROOT=/usr/lib64/jvm/jre' 'JAVA_HOME=/usr/lib64/jvm/jre' 'JRE_HOME=/usr/lib64/jvm/jre' 'CVS_RSH=ssh' 'XCURSOR_THEME=DMZ' 'QT_SYSTEM_DIR=/usr/share/desktop-data' 'LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'NXDIR=/usr/NX' 'FROM_HEADER=' 'NNTPSERVER=news' 'WINDOWMANAGER=/usr/bin/kde4' 'ALSA_CONFIG_PATH=/etc/alsa-pulse.conf' 'SDL_AUDIODRIVER=pulse' 'PYTHONSTARTUP=/etc/pythonstart' 'XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help' 'XDG_CONFIG_DIRS=/etc/xdg' 'G_BROKEN_FILENAMES=1' 'G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252' 'CSHRCREAD=true' 'LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:' 'LS_OPTIONS=-N --color=tty -T 0' 'GPG_TTY=/dev/pts/0' 'MODULE_VERSION=3.2.8' 'MODULE_VERSION_STACK=3.2.8' 'MODULESHOME=/w/appl/lft/Modules/3.2.8' 'MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications' 'LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0' '_LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0' 'INTEL_LICENSE_FILE=28518 at mlr25s' 'LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64' 'PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:' 'BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:' 'SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349' 'SSH_AGENT_PID=6350' --global-user-env 0 --global-system-env 1 'GFORTRAN_UNBUFFERED_PRECONNECTED=y' --proxy-core-count 1 --exec --exec-appnum 0 --exec-proc-count 1 --exec-local-env 0 --exec-wdir /btmpl/Software --exec-args 1 hostname
>
> Arguments being passed to proxy 1:
> --version 1.5 --iface-ip-env-name MPICH_INTERFACE_HOSTNAME --hostname mlr113u --global-core-map 0,3,4 --pmi-id-map 0,1 --global-process-count 4 --auto-cleanup 1 --pmi-kvsname kvs_6389_0 --pmi-process-mapping (vector,(0,1,1),(1,1,3)) --ckpoint-num -1 --global-inherited-env 75 'LANG=de_DE' 'USER=hauffe' 'LOGNAME=hauffe' 'HOME=/home/lft/mitarbeiter/hauffe' 'PATH=/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin:/w/appl/lft/jdk1.7.0_07-linux-x64/bin:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/bin/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/bin/intel64:/w/appl/lft/Modules/3.2.8/bin:/usr/NX/bin:/usr/lib64/mpi/gcc/openmpi/bin:/home/lft/mitarbeiter/hauffe/bin:/usr/bin:/bin:/usr/sbin:/sbin:/usr/local/bin:/usr/bin/X11:/usr/X11R6/bin:/usr/games:/usr/lib64/jvm/jre/bin:/usr/lib/mit/bin:/usr/lib/mit/sbin:/w/appl/lft/bin' 'MAIL=/var/mail/hauffe' 'SHELL=/usr/bin/tcsh' 'SSH_CLIENT=141.30.52.234 53502 22' 'SSH_CONNECTION=141.30.52.234 53502 141.30.156.114 22' 'SSH_TTY=/dev/pts/0' 'TERM=xterm' 'XDG_SESSION_ID=144' 'XDG_RUNTIME_DIR=/run/user/hauffe' 'NLSPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64/locale/en_US:/usr/share/locale/%L/LC_MESSAGES/%N.cat' 'HOSTTYPE=x86_64' 'VENDOR=suse' 'OSTYPE=linux' 'MACHTYPE=x86_64-suse-linux' 'SHLVL=1' 'PWD=/btmpl/Software' 'GROUP=lftuser' 'HOST=mlr114u' 'CSHEDIT=emacs' 'CPU=x86_64' 'HOSTNAME=mlr114u.mw.tu-dresden.de' 'INPUTRC=/home/lft/mitarbeiter/hauffe/.inputrc' 'LESS=-M -I -R' 'LESSOPEN=lessopen.sh %s' 'LESSCLOSE=lessclose.sh %s %s' 'LESS_ADVANCED_PREPROCESSOR=no' 'LESSKEY=/etc/lesskey.bin' 'PAGER=less' 'MORE=-sl' 'MINICOM=-c on' 'MANPATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/man/en_US:/w/appl/lft/Modules/3.2.8/share/man:/usr/lib64/mpi/gcc/openmpi/man:/usr/share/man:/usr/local/man' 'XKEYSYMDB=/usr/X11R6/lib/X11/XKeysymDB' 'XNLSPATH=/usr/share/X11/nls' 'COLORTERM=1' 'SSH_SENDS_LOCALE=yes' 'JAVA_BINDIR=/usr/lib64/jvm/jre/bin' 'JAVA_ROOT=/usr/lib64/jvm/jre' 'JAVA_HOME=/usr/lib64/jvm/jre' 'JRE_HOME=/usr/lib64/jvm/jre' 'CVS_RSH=ssh' 'XCURSOR_THEME=DMZ' 'QT_SYSTEM_DIR=/usr/share/desktop-data' 'LD_LIBRARY_PATH=/w/appl/lft/jdk1.7.0_07-linux-x64/jre/lib/amd64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/mpirt/lib/intel64:/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64:/usr/lib64/mpi/gcc/openmpi/lib64:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'NXDIR=/usr/NX' 'FROM_HEADER=' 'NNTPSERVER=news' 'WINDOWMANAGER=/usr/bin/kde4' 'ALSA_CONFIG_PATH=/etc/alsa-pulse.conf' 'SDL_AUDIODRIVER=pulse' 'PYTHONSTARTUP=/etc/pythonstart' 'XDG_DATA_DIRS=/usr/local/share:/usr/share:/usr/share/gnome/help' 'XDG_CONFIG_DIRS=/etc/xdg' 'G_BROKEN_FILENAMES=1' 'G_FILENAME_ENCODING=@locale,UTF-8,ISO-8859-1,CP1252' 'CSHRCREAD=true' 'LS_COLORS=no=00:fi=00:di=01;34:ln=00;36:pi=40;33:so=01;35:do=01;35:bd=40;33;01:cd=40;33;01:or=41;33;01:ex=00;32:*.cmd=00;32:*.exe=01;32:*.com=01;32:*.bat=01;32:*.btm=01;32:*.dll=01;32:*.tar=00;31:*.tbz=00;31:*.tgz=00;31:*.rpm=00;31:*.deb=00;31:*.arj=00;31:*.taz=00;31:*.lzh=00;31:*.lzma=00;31:*.zip=00;31:*.zoo=00;31:*.z=00;31:*.Z=00;31:*.gz=00;31:*.bz2=00;31:*.tb2=00;31:*.tz2=00;31:*.tbz2=00;31:*.xz=00;31:*.avi=01;35:*.bmp=01;35:*.fli=01;35:*.gif=01;35:*.jpg=01;35:*.jpeg=01;35:*.mng=01;35:*.mov=01;35:*.mpg=01;35:*.pcx=01;35:*.pbm=01;35:*.pgm=01;35:*.png=01;35:*.ppm=01;35:*.tga=01;35:*.tif=01;35:*.xbm=01;35:*.xpm=01;35:*.dl=01;35:*.gl=01;35:*.wmv=01;35:*.aiff=00;32:*.au=00;32:*.mid=00;32:*.mp3=00;32:*.ogg=00;32:*.voc=00;32:*.wav=00;32:' 'LS_OPTIONS=-N --color=tty -T 0' 'GPG_TTY=/dev/pts/0' 'MODULE_VERSION=3.2.8' 'MODULE_VERSION_STACK=3.2.8' 'MODULESHOME=/w/appl/lft/Modules/3.2.8' 'MODULEPATH=/w/appl/lft/Modules/versions:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/lft-apps:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/compilers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/debuggers:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/libraries:/w/appl/lft/Modules/$MODULE_VERSION/modulefiles/applications' 'LOADEDMODULES=modules:lft:nedit/5.5:ifc/11.8.273:paraview/3.14.1:jdk/1.7.0_u7:netbeans/7.2:paraview/3.14.1_batch:mpich2/1.5.0p0' '_LMFILES_=/w/appl/lft/Modules/default/modulefiles/modules:/w/appl/lft/Modules/default/modulefiles/lft:/w/appl/lft/Modules/3.2.8/modulefiles/applications/nedit/5.5:/w/appl/lft/Modules/3.2.8/modulefiles/compilers/ifc/11.8.273:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1:/w/appl/lft/Modules/3.2.8/modulefiles/applications/jdk/1.7.0_u7:/w/appl/lft/Modules/3.2.8/modulefiles/applications/netbeans/7.2:/w/appl/lft/Modules/3.2.8/modulefiles/applications/paraview/3.14.1_batch:/w/appl/lft/Modules/3.2.8/modulefiles/libraries/mpich2/1.5.0p0' 'INTEL_LICENSE_FILE=28518 at mlr25s' 'LIBRARY_PATH=/mnt/appl/lft/composer_xe_2011_sp1.8.273/compiler/lib/intel64' 'PYTHONPATH=/w/appl/x86_64/paraview/3.14.1_batch/Utilities/VTKPythonWrapping/site-packages:/w/appl/x86_64/paraview/3.14.1_batch/bin' 'BSTINPUTS=/home/lft/mitarbeiter/hauffe/myLatex/styles/BibTeX//:' 'BIBINPUTS=/home/lft/mitarbeiter/hauffe/Documents/Bibliographien/:.:' 'SSH_AUTH_SOCK=/tmp/ssh-rjXtIUUZ6349/agent.6349' 'SSH_AGENT_PID=6350' --global-user-env 0 --global-system-env 1 'GFORTRAN_UNBUFFERED_PRECONNECTED=y' --proxy-core-count 3 --exec --exec-appnum 0 --exec-proc-count 3 --exec-local-env 0 --exec-wdir /btmpl/Software --exec-args 1 hostname
>
> [mpiexec at mlr114u] Launch arguments: /mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id 0
> [mpiexec at mlr114u] Launch arguments: /usr/bin/ssh -x mlr113u "/mnt/appl/lft/mpich/mpich2-1.5.0p0_64bit/bin/hydra_pmi_proxy" --control-port mlr114u:46425 --debug --rmk user --launcher ssh --demux poll --pgid 0 --retries 10 --usize -2 --proxy-id 1
> mlr114u
> [mpiexec at mlr114u] control_cb (./pm/pmiserv/pmiserv_cb.c:201): assert (!closed) failed
> [mpiexec at mlr114u] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
> [mpiexec at mlr114u] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:196): error waiting for event
> [mpiexec at mlr114u] main (./ui/mpich/mpiexec.c:325): process manager error waiting for completion
>
>
> Zitat von Dave Goodell <goodell at mcs.anl.gov>:
>
>> Please also send us the output of "mpiexec -v -f machinefile -n 4 hostname" (note the "-v" addition). This will give us a better idea of why hydra is having trouble launching the hostname program.
>>
>> -Dave
>>
>> On Oct 15, 2012, at 2:50 AM CDT, Andreas Hauffe wrote:
>>
>>> Hi,
>>>
>>> I need help to run a MPI-Program with MPICH2 1.5 on more than one host. If I
>>> try the install example:
>>> mpiexec -f machinefile -n <number> hostname
>>> I get the following output:
>>> host> mpiexec -f machinefile -n 4 hostname
>>> mlr114u
>>> [mpiexec at mlr114u] control_cb (./pm/pmiserv/pmiserv_cb.c:201): assert (!closed)
>>> failed
>>> [mpiexec at mlr114u] HYDT_dmxu_poll_wait_for_event
>>> (./tools/demux/demux_poll.c:77): callback returned error status
>>> [mpiexec at mlr114u] HYD_pmci_wait_for_completion
>>> (./pm/pmiserv/pmiserv_pmci.c:196): error waiting for event
>>> [mpiexec at mlr114u] main (./ui/mpich/mpiexec.c:325): process manager error
>>> waiting for completion
>>>
>>> I attachted all files but there was no
>>> mpich2-1.5/src/pm/hydra/tools/topo/hwloc/hwloc/config.log
>>>
>>> The command "mpiexec -info":
>>>
>>> HYDRA build details:
>>> Version: 1.5
>>> Release Date: Mon Oct 8 14:00:48 CDT 2012
>>> CC: gcc
>>> CXX: c++
>>> F77: ifort
>>> F90: ifort
>>> Configure options: '--disable-option-checking' '--
>>> prefix=/w/appl/lft/mpich/mpich2-1.5.0p0_64bit' '--cache-file=/dev/null' '--
>>> srcdir=.' 'CC=gcc' 'CFLAGS= -O2' 'LDFLAGS= ' 'LIBS=-lrt -lpthread ' 'CPPFLAGS=
>>> -I/btmpl/Software/mpich2-1.5/src/mpl/include -
>>> I/btmpl/Software/mpich2-1.5/src/mpl/include -
>>> I/btmpl/Software/mpich2-1.5/src/openpa/src -
>>> I/btmpl/Software/mpich2-1.5/src/openpa/src -
>>> I/btmpl/Software/mpich2-1.5/src/mpi/romio/include'
>>> Process Manager: pmi
>>> Launchers available: ssh rsh fork slurm ll lsf sge
>>> manual persist
>>> Topology libraries available: hwloc
>>> Resource management kernels available: user slurm ll lsf sge pbs
>>> Checkpointing libraries available:
>>> Demux engines available: poll select
>>>
>>> --
>>> Viele Grüße
>>> Andreas Hauffe
>>> Leiter der Arbeitsgruppe "Auslegungsmethoden für Luftfahrzeuge"
>>>
>>> ----------------------------------------------------------------------------------------------------
>>> Technische Universität Dresden
>>> Institut für Luft- und Raumfahrttechnik / Institute of Aerospace Engineering
>>> Lehrstuhl für Luftfahrzeugtechnik / Chair of Aircraft Engineering
>>>
>>> D-01062 Dresden
>>> Germany
>>>
>>> phone : +49 (351) 463 38496
>>> fax : +49 (351) 463 37263
>>> mail : andreas.hauffe at tu-dresden.de
>>> Website : http://tu-dresden.de/mw/ilr/lft
>>> ----------------------------------------------------------------------------------------------------<files.tar.gz>_______________________________________________
>>> mpich-discuss mailing list mpich-discuss at mcs.anl.gov
>>> To manage subscription options or unsubscribe:
>>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>>
>> _______________________________________________
>> mpich-discuss mailing list mpich-discuss at mcs.anl.gov
>> To manage subscription options or unsubscribe:
>> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>>
>
>
> _______________________________________________
> mpich-discuss mailing list mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
More information about the mpich-discuss
mailing list