Hi,<br><br>I have just installed MPICH2 in my Xen-based virtual machines.<br><br>My hardware configuration is as follows:<br><br>Processor: Intel Pentium Dual Core E6300 @ 2.8 GHz<br>Motherboard: Intel Desktop Board DQ45CB BIOS 0093<br>
Memory: 4X 2GB Kingston DDR2-800 CL5<br><br>My software configuration is as follows:<br><br>Xen Hypervisor / Virtual Machine Monitor Version: 3.5-unstable<br>Jeremy Fitzhardinge's pv-ops dom0 kernel: 2.6.31.4<br>Host Operating System: Fedora Linux 11 x86-64 (SELinux disabled)<br>
Guest Operating Systems: Fedora Linux 11 x86-64 paravirtualized (PV) domU guests (SELinux disabled)<br><br>I have successfully configured, built and installed MPICH2 in a F11 PV guest OS master compute node 1 with NFS server (MPICH2 bin subdirectory exported). The rest of the 5 compute nodes have access to the MPICH2 binaries by mounting NFS share from node 1. Please see attached c.txt, m.txt and mi.txt. With Xen virtualization, I have created 6 F11 linux PV guests to simulate 6 HPC compute nodes. The network adapter (NIC) in each guest OS is virtual. The Xen networking type is bridged. Running "lspci -v" and lsusb in each guest OS does not show up anything.<br>
<br>According to Appendix A troubleshooting section of the MPICH2 install guide, I have verified that the 2-node test scenario with "mpdcheck -s" and "mpdcheck -c" is working. The 2 nodes each acting as server and client respectively can communicate with each other without problems. Both nodes can communicate with each other in server and client modes respectively. I have also tested mpdboot with the 2-node scenario and the ring of mpd is working.<br>
<br>After the troubleshooting process, I have successfully created a ring of mpd involving 6 compute nodes. "mpdtrace -l" successfully lists all the 6 nodes. However, when I want to run a job with mpiexec, it gives me the following error:<br>
<br>[enming@enming-f11-pv-hpc-node0001 ~]$ mpiexec -n 2 examples/cpi<br>mpiexec_enming-f11-pv-hpc-node0001 (mpiexec 392): no msg recvd from mpd when expecting ack of request<br><br>I have also tried starting the mpd ring with the root user but I still encounter the same error above.<br>
<br>Thank you.<br><br>PS. config.log is also attached.<br clear="all"><br>-- <br>Mr. Teo En Ming (Zhang Enming) Dip(Mechatronics) BEng(Hons)(Mechanical Engineering)<br>Alma Maters:<br>(1) Singapore Polytechnic<br>(2) National University of Singapore<br>
My blog URL: <a href="http://teo-en-ming-aka-zhang-enming.blogspot.com">http://teo-en-ming-aka-zhang-enming.blogspot.com</a><br>My Youtube videos: <a href="http://www.youtube.com/user/enmingteo">http://www.youtube.com/user/enmingteo</a><br>
Email: <a href="mailto:space.time.universe@gmail.com">space.time.universe@gmail.com</a><br>MSN: <a href="mailto:teoenming@hotmail.com">teoenming@hotmail.com</a><br>Mobile Phone (SingTel): +65-9648-9798<br>Mobile Phone (Starhub Prepaid): +65-8369-2618<br>
Age: 31 (as at 30 Oct 2009)<br>Height: 1.78 meters<br>Race: Chinese<br>Dialect: Hokkien<br>Street: Bedok Reservoir Road<br>Country: Singapore<br>