Error during mpdboot

Bruno Nyffeler bruno.nyffeler at isb-sib.ch
Tue May 31 04:39:21 CDT 2005


Hello

I have been installing MPICH2 on several clusters and it worked flawless 
on almost all machines. Just on one small cluster (Redhat 7.3) mpdboot 
does not work and produces an error. So, I tried to start mpd directly 
on two of the machines (like it is described in the installers guide), 
using sth like:

host1> mpd -e &
47684
host2> mpd -h host1 -p 47684

The server (on host1) prints this message:

    host1_47684 (_handle_new_connection 940): INVALID msg from new 
connection :('10.255.255.254', 50674): msg=:{}:

while mpd on host2 quits with:

    host2_50673 failed ; cause: invalid challenge msg: {}
        traceback: [('/tmp/mpich2/mpich2-1.0.1/bin/mpd', '1158', 
'_enter_existing_ring'),     ('/tmp/mpich2/mpich2-1.0.1/bin/mpd', '175', 
'_mpd_init'), ('/tmp/mpich2/mpich2-1.0.1/bin/mpd', '1398', '?')]


I also tried the following using mpdcheck:

host1> mpdcheck -s
server listening at INADDR_ANY on: host1 47700

host2> mpdcheck -c host1 47700

which exits on host1 with:

    server has conn on <socket object, fd=5, family=2, type=1, 
protocol=0> from ('10.255.255.254', 50677)
    server successfully recvd msg from client: hello_from_client_to_server

and on host2 with:

    client successfully recvd ack from server: ack_from_server_to_client



The machines are based on Intel Xeon CPUs and the OS is RedHat 7.3, 
kernel 2.4.18-27.
Do you have any ideas about this?

Thank you in advance,

Bruno Nyffeler




-- 
=============================================
Dr. Bruno Nyffeler
Swiss Institute of Bioinformatics
Vital-IT UNIL - BEP
1015 Lausanne Switzerland
email    bruno.nyffeler at isb-sib.ch
phone    +41 21 6924033
mobile  +41 79 5154556
fax    +41 21 6924065
=============================================




More information about the mpich-discuss mailing list